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A NERVE GROWTH-STIMULATING FACTOR ISOLATED 
FROM SNAKE VENOM* 


By STANLEY COHEN AND Rita Levi-MonrTALCINI 
DEPARTMENT OF ZOOLOGY, WASHINGTON UNIVERSITY, ST. LOUIS, MISSOURI 
Communicated by V. Hamburge r, July 19, 1956 


Introduction.—In the past our efforts have been directed toward the purification 
of an agent contained in mouse sarcomas 180 and 37 which has remarkable growth- 
promoting effects on sympathetic and spinal ganglia of the chick embryo." * 
In the course of attempts to characterize this material, use was made of crude snake 
venom as a source of phosphodiesterase. Treatment of the sarcoma factor with 
snake venom enhanced its activity, and subsequent tests showed that snake 
venom alone contains a very potent growth-promoting agent. In fact, it can be 
shown that crude venom is approximately 3,000-6,000 times as active as crude 
tumor homogenates (on a dry-weight basis) in promoting nerve fiber outgrowth in 
spinal ganglia in vitro. In the following, we are presenting the results of a partial 
purification and characterization of the snake-venom factor. 

Materials and Methods.—Dried moccasin snake venom (Agkistrodon piscivorus) 
was obtained from the Ross Allen Reptile Institute. For the assay of the nerve 
growth-promoting activity, hanging-drop tissue cultures were made containing 
plasma (rooster), synthetic medium 1066 (with thrombin), and the material to be 
tested. The details of the procedure have been presented elsewhere.* Each 
culture contained three or four sensory ganglia isolated from 7-day chick embryos. 


lA) 


The cultures were observed after 18 hours of incubation at 37°, and the growth of 
the fibers was semiquantitatively recorded from 1+ to 4+ (see Figs. 1-4 of our 
previous publication’). The assay was sensitive to twofold changes in concentra- 
tion of the active material; smaller changes were not detectable. 

Experimental Results.—The effect of crude venom when it is added to the cul- 
tures ata concentration of 7 ug/mlisshown in Figures | and2. The active component 
of the venom was nondialyzable. Solutions of the venom (30 ug/ml in distilled 
water) were then treated with alkali (0.1 N sodium hydroxide for 1 hour at 26°), 
acid (0.1 N hydrochloric acid for 1 hour at 26°), 6 NV urea (1 hour at 0°), and heat 
(5 minutes at 90°) in a manner identical to that described elsewhere’ for the tumor 
factor. The biological activity was stable to urea and alkali and was completely 
destroyed by the heat and acid treatments. This behavior is similar to that shown 
by the “protein fraction” isolated from sarcoma 180. 

In exploratory experiments it was found that the effect of the venom was abolished 
by the anti-snake-venom serum.‘ In these experiments a mixture of venom (20 
ug/ml) and antiserum (40 ug/ml) was allowed to stand for 1 hour at 26°. Aliquots 
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were then tested for biological activity. The inhibitory effect of the antiserum sug- 
gests that the active factor is associated with one of the protein components of the 


venom. 
When the solution of the crude venom was allowed to stand overnight at 20 
approximately three-fourths of the activity was destroyed. However, if the crude 


venom Was treated with 6 NV urea, its stability was increased, and only slight loss of 
activity (less than 25 per cent) was noted after standing for 18 hours at 20°. This 
may be due to the destruction by the urea of an inactivating enzyme in the venom. 
Advantage of this was taken during the following purification procedure. 

One gram of venom (A. piscivorus) and 36 gm. of urea were dissolved in 80 ml. of 
water containing 3 ml. of 0.1 N NaOH. The solution was allowed to stand for 90 
minutes at 0°. All subsequent procedures were carried out at temperatures from 
0° to 3°. Saturated ammonium sulfate (at 0°, pH 7.6) was added to the solution 
until a final concentration of 48 per cent saturation was obtained. The mixture 
was allowed to stand for 15 minutes and was then centrifuged for five minutes at 
16,000 x g. The precipitate was discarded. To the supernatant fluid am- 
monium sulphate was added to a final concentration of 71 per cent saturation. 
The mixture was again allowed to stand for 15 minutes and then was centrifuged. 
The yellowish supernatant fluid was discarded. The precipitate, containing the ac- 
tive factor, was dissolved in 40 ml. of water and dialyzed against distilled water 
overnight. The material was then centrifuged for 10 minutes at 16,000 x g, and 
the slight precipitate was discarded. The supernatant fluid was again fractionated 
with the above saturated ammonium sulfate solution (in the absence of urea); the 
active material precipitated between 46 and 59 per cent saturation. After centrif- 
ugation the precipitate was dissolved in 20 ml. of water and dialyzed overnight 
against distilled water. This material may be stored in the frozen condition for at 
least several weeks with no detectable loss in activity. Ninety-five milligrams (dry 
weight) and approximately 40 per cent of the original total activity was recovered; 
a fourfold purification was thus achieved. 

The biological activity of this material is more stable to the denaturing effects 
of the organic solvents than was the factor purified from sarcoma 180. ‘To solutions 
containing 30 ug/ml were added ethanol, methanol, and acetone, to a final con- 
centration of 66 per cent. The mixtures were allowed to stand for 3 hours at 30°. 
The organic solvents were removed by vacuum distillation at 3° and dialysis. Over 
50 per cent of the original activity was retained. Similar treatment almost com- 
pletely inactivated the tumor factor.? 

The 280/260 my absorption ratio in distilled water, pH 7.0, was 1.75. The addi- 
tion of streptomycin (0.01 M) caused no visible precipitation. These data indicate 
that only traces, if any, of nucleic acids were present. One milligram of this mate- 
rial was equivalent to 1.5 mg. of protein as determined by the procedure of Lowry 
et al.,® using bovine albumin as a standard; subsequent fractionations were based 
on protein content. 

The venom factor was then further purified by adsorption and elution from the 
anionic and cationic cellulose resins DEAE-SF (1.1 meq/gm) and CM-cellulose 
(0.5 meq/gm).® Ten milliliters of the ammonium sulfate fraction (containing 3.3 
mg. of protein per milliliter) were passed through a column 1.4 em. in diameter 
containing 500 mg. of the DEAE-SF at a flow rate of 0.1-0.2 mi/minute. The 
column. was then washed with 10 ml. of distilled water. Only traces of protein and 





Fics. 1-4.—-Microphotographs of living sensory ganglia (from 7-day chick embryos) after 18 
hours of incubation at 37°. Fig. 1, the standard control medium was used. Fig. 2, the medium 
contained 7 ug/ml of crude snake venom. Fig. 3, the medium contained 0.3 ug/ml of the CM frae- 
iion obtained from the venom. Fig. 4. The medium contained 250 ug/ml of the “protein frac- 
tion”’ purified from sarcoma 180. 
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no activity were present in this combined fraction. The active material was then 
eluted with 20 ml. of 0.01 WZ sodium chloride in 0.01 7 potassium phosphate buffer, 
pH 7.4. This fraction (DEAE fraction) contained 15 mg. of protein and almost all 
(over 75 per cent) of the biological activity. The 18 mg. of protein which re- 
mained on the column could be quantitatively eluted with 20 ml. of 0.5 M sodium 
chloride in the phosphate buffer. This fraction was biologically inactive. 

The active fraction was then dialyzed with stirring for 5 hours against distilled 
water, and 10 ml. containing a total of 7 mg. of protein were applied to a column 
1.4 cm. in diameter containing 500 mg. of CM-cellulose at a flow rate of 0.1-0.2 
ml/minute. The column was then washed with 10 ml. of water. This combined 
fraction contained 4.1 mg. of protein and was inactive. The activity was then 
eluted with 10 ml. of 0.1 M sodium chloride in 0.01 M potassium phosphate buffer, 
pH 7.4. This fraction (CM fraction) contained 1.9 mg. of protein and again almost 
all the activity (over 75 per cent). The remaining 1 mg. of protein could be eluted 
with 0.5 MW sodium chloride in phosphate buffer and was inactive. 

To summarize: approximately 25 per cent of the activity present in 1 gm. of the 
crude venom was recovered in 11.7 mg. of a “‘protein”’ fraction, a purification of ap- 
proximately twenty fold. 

The effect of the CM fraction on sensory ganglia in tissue cultures at a protein 
concentration of 0.3 ug/ml is shown in Figure 3. For comparison, the effect of the 
“protein fraction” obtained from sarcoma 180 at a concentration of 250 ug/ml is 
shown in Figure 4. The effect of the CM fraction in promoting the outgrowth of 
nerve fibers in the spinal ganglia appears to be identical with that of the tumor 
factor. In addition, as will be shown in another publication,’ the injection of the 
DEAE fraction into the yolk of the chick embryo duplicates the nerve growth- 
promoting properties of the sarcomas in the embryo. 

The venom from the two species examined, the moccasin (A. piscworus) and the 
rattlesnake (Crotalus adamanteus), were equally effective in promoting fiber out- 
growth in tissue culture. 

The question arises concerning the identity of the factors purified from sarcoma 
180 and venom. It would seem highly improbable that two chemically identical 
proteins would be found in such unrelated materials. The possibility that both 
factors have a common enzymatic activity is under investigation. 

Summary.—A heat-labile, nondialyzable factor with very potent nerve growth- 
stimulating properties has been partially purified from snake venom. The biologi- 
cal effects of the venom factor, tested in tissue culture, are very similar to the effects 


produced by a factor obtained from sarcoma 180. However, our most purified 


venom preparations have a specific activity (on a protein basis) approximately 
1,000 times as high as our purest tumor fraction. The data suggest that in both 
instances the active material is a protein or is bound to a protein. 


* This work has been supported by Grant B-463 from the National Institute of Neurological 
Diseases and Blindness of the National Institutes of Health, Public Health Service, by a grant of 
the American Cancer Society, recommended by the Committee on Growth, National Research 
Council, and by a contribution from an institutional grant of the American Cancer Society to 
Washington University. The able assistance of Miss Barbara Bankhead is gratefully acknowl- 
edged. 

1 R. Levi-Montaleini, H. Meyer, and V. Hamburger, Cancer Research, 14, 49-57, 1954. 

2S. Cohen, R. Levi-Montalcini, and V. Hamburger, these ProcEEpinGs, 40, 1014-1018, 1954. 
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6 We wish to thank Drs. A. Kornberg and H. Pahl for the diethylaminoethyl (DEAE) and 
carboxymethyl (CM) cellulose ion exchangers. 
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BACTERIAL PROTOPLASTS INDUCED BY PENICILLIN* 
By JosHua LEDERBERG 
DEPARTMENT OF GENETICS, UNIVERSITY OF WISCONSIN, MADISON, WISCONSIN 
Communicated by Sewall Wright, May 28, 1956 
Bacterial protoplasts are believed to be cellular units that have been deprived of 
their rigid cell wall.! Accordingly, they are distinguished by their spherical shape 


(in bacilliform species) and their sensitivity to cytolysis in hypotonic media. 
Several authors!~* have suggested that protoplasts might be useful material for the 


study of biosynthesis of viruses and enzymes in a system simpler or more accessible 


to external modification than the intact cell. So far, only gram-positive bacteria, 
which are susceptible to lysozyme, have been used for such studies. A method has 
now been found for the efficient production of protoplasts from enteric bacteria such 
as Escherichia coli and Salmonella typhimurium, which are already familiar physio- 
logical and genetic subjects. The technique consists essentially of the exposure of 
growing cells to a medium containing penicillin, sucrose, and Mg*+ 

Preparation of Protoplasts.—Escherichia coli strain K-12 and a variety of its 
mutant substrains are used in most of the experiments. ‘The use of penicillin and 
sucrose was suggested by the possible analogy between penicillin and lysozyme as 
lytic agents and by the finding that hypertonic sucrose would interrupt bacterioly- 
sis of Bacillus megaterium by lysozyme.! In addition, spherical bodies had been 
casually noted in other applications of penicillin,t and many authors have em- 
phasized its use in the production of L-forms.6 The following procedure was 
adopted after empirical trials and can doubtless be further improved. 

The bacteria were grown overnight in tubes with 10 ml. of broth (Difco penassay 
medium) at 37° C., ona rotator. Samples of 3 ml. of the grown culture (about 2 X 
10° cells/ml) were added directly to 10 ml. of broth supplemented with penicillin, 
1,000 u/ml, sucrose 20 per cent, and magnesium sulfate 0.2 per cent. In 2-3 
hours the cells were quantitatively converted into spheres. During this interval 
the optical density (measured at 650 my in a Coleman 14 spectrophotometer) in- 
creased about 50 per cent, but the total count (spheres or rods, estimated in a 
Petroff-Hausser chamber) remained constant. The spheres promptly lysed when 
the suspension was diluted in distilled water, and they are therefore regarded as 
“protoplasts.” 

The indicated supplements are in substantial excess, and nearly optimal yields 
of protoplasts can be obtained with 5 per cent sucrose, 100 u. penicillin, and 0.1 
per cent MgSO,. The high magnesium requirement may depend partly on binding 
with the sodium citrate used in the compounding of the penicillin preparation 
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(Squibb penicillin G, potassium, buffered). The Mg++ can be replaced by Cat+ 
but was preferred, to minimize precipitation of phosphate. In the absence of 
Mg**, protoplasts are formed but are less well preserved, and a third or less of the 
initial rods are recovered. In the absence of sucrose or comparable stabilizer, the 
cells are almost totally lysed, leaving only debris. In the absence of penicillin, the 
rods grow at nearly the normal rate, i.e., are not appreciably inhibited by 20 per 
cent sucrose, 

Protoplasts were also formed in minimal medium, but the transformation has 
been irregular and incomplete in the trials to date. Cells exposed to penicillin 
under conditions not supporting growth, e.g., in sucrose buffer, were not trans- 
formed into spheres. 

Microscopic Observations and Viability.—The transformation of rods into spheres 
was observed in small droplets each containing ten to twenty cells, immersed in an 
oil chamber.’ Each rod gave rise to a single sphere; even cells about to divide 
were inhibited in further growth and division. The progression of stages as ob- 
served with dark phase contrast at 645 is illustrated in Figure 1. A swelling 
first appears, either subterminally or (in incipient division stages) centrally, and 
progressively enlarges. The remainder of the rod then withers away and disap- 


pears. 


-—- 28 20) 


oo aKkd 


Fic. 1.-Above: Transformation of bacterial rod to spherical protoplast in the presence of peni- 
cillin. Below; reversion of protoplast to rod in absence of penicillin. Observed under dark phase 
contrast. 


The reversion of spherical protoplasts to rods, in the absence of penicillin, has 
been followed by a similar technique. The spheres first enlarge, then develop 
amoeboid, finally rhizoid, outgrowths which segment to give typical rods. Spheres 
incubated in the penicillin medium continue to enlarge but vacuolate and eventually 
lyse over an interval of 24 hours. The optical density of the suspensions shows a 
parallel cycle. The enlarging spheres are highly reminiscent of published figures of 
L-forms,® but no evidence for multiplication of spherical elements has been seen so 
far in these experiments or in trials in which 10 per cent bovine or equine serum was 
furnished. The protoplast suspensions remain intact at least for several days at 
1° C. After spontaneous lysis, or upon cytolysis in water, a residual ghost is seen 
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which retains its spherical form, although deprived of phase-dense material. The 
lysates are quite viscous, presumably from the release of desoxyribonucleic acid. 


About half the protoplasts are capable of reverting to rods and engendering 
typical bacillary colonies in the absence of penicillin. This was verified by com- 
parisons of total with viable counts on EMB lactose agar plus 20 per cent sucrose. 
The viability of the protoplasts is completely lost by cytolysis in water. On the 
other hand, normal rods are unimpaired when suspensions in sucrose solution are 
similarly diluted. 

Enzyme Synthesis and Other Observations.—Protoplasts were tested for their 
ability to synthesize 8-p-galactosidase: suspensions were sedimented, washed 
with 20 per cent sucrose, and resuspended in lactose medium (casein digest. broth 
plus sucrose plus MgSQ, plus penicillin plus 1 per cent lactose). Over an interval 
of several hours, samples were assayed with o-nitrophenyl galactoside.? In 210 
minutes the enzyme activity increased from an initial value of 0.38 to 73 units per 
10° spheres, a two-hundred-fold increase, while the optical density increased about 
twofold. (One unit corresponds to the splitting of 10~* moles of substrate per 
minute. The peak activity found in spheres from lactose-grown cells is about 150 
units. According to Rotman and Spiegelman,’ each of these units corresponds to 
5 X 10-° gm. nitrogen, or ~2 X 10” molecules of purified enzyme protein.) 
Most of the induced enzyme activity sedimented with the protoplasts but was re- 
leased to the supernate when the protoplasts were cytolysed in water. 

Protoplasts were produced in a similar fashion in F. coli strain B and in S. 
typhimurium strain TM2. The cells of the latter swam very actively, but the 
protoplasts were immotile, except for an occasional sphere on which a bacillary 
protuberance persisted (or had regenerated). Similarly, the protoplasts failed to 
agglutinate in homologous antiflagellar serum. However, electron-microscopic 
studies will be needed to establish the relationship of flagella to cell walls in this 
species. Protoplast suspensions gave a faint agglutination in homologous somatic 
antiserum. 

The protoplasts of F. coli K-12 at first appeared to be readily stainable with 
methylene blue but were later found to be very sensitive to photodynamic effects 
which result either in lysis or stainability. Boiled protoplasts retain their form 
and are evidently fixed as they become much more opaque, stain readily, and do not 
cytolyse in water. Stained protoplasts often show a denser inclusion about one- 
half their diameter, a poorly staining cortex, and a deeper-staining periphery. The 
protoplasts reduce triphenyl-tetrazolium chloride, as well as methylene blue, very 
readily. With the former, a typical formazan granule is produced, usually single 
and near the periphery of the protoplast, much as in the rods.°. No corresponding 
structure was seen in the untreated spheres. 

In view of reports on the fusion of protoplasts,’ and the possibility of DNA- 
mediated transduction, preparations have been made with a variety of appropriate 
genotypic mixtures of /. coli K-12. No evidence of hitherto unrecognized modes 
of genetic recombination has been uncovered in preliminary trials. Protoplast 
suspensions of appropriate compatibility types did, however, retain their ability to 
mate. 

Discussion.—Most of the findings in this preliminary survey warrant more in- 
tensive and exact quantitative study. The areas of the interest already expressed 





Vou. 42, 1956 BIOCHEMISTRY: J. LEDERBERG 


in bacterial protoplasts! are already beyond the scope of any one laboratory, and a 
ready method for their production in enteric bacteria may encourage further atten- 
tion. In addition to their use in the extraction of intracellular constituents by 
cytolysis and their availability for biosynthetic studies (enzymes, viruses, cell 
walls), they may also be useful in cytological, antigenic, and physiological analysis. 
Their possible genetic applications are currently being studied in this laboratory. 

The preservation of viable protoplasts in sucrose solution supports the hypothe- 
sis that the primary action of penicillin is on the synthesis or maintenance of a 
component of the cell wall,'' while the activity and synthesis of other enzymes and 
over-all protoplasmic mass are unaffected. 

The protoplasts of E. coli show some instructive differences from those of B. 
megateritum. The latter are rapidly produced by the enzymatic dissolution of the 
existing wall, and each rod produces not one but several spheres. The latter dif- 
ference presumably corresponds to the septate structure of gram-positive rods, 
each compartment containing a single protoplast.'? Gram-negative bacteria, on 
the other hand, are believed to be true multinucleate coenocytes. The most striking 
difference may vanish with technical improvements—namely, that the protoplasts 
of B. megaterium are reportedly inviable, while those of L’. coli revert to normal rods, 
an obvious advantage for genetic research. 

Summary.—Growing cells of 2. coli or S. typhimurium are quantitatively trans- 
formed into spherical “protoplasts” in the presence of penicillin, magnesium, and 
high concentrations of sucrose. The protoplasts are quickly cytolysed in water 
but revert to viable rods in the absence of penicillin. The protoplasts have not 
been observed to divide as such but have been shown to enlarge and to synthesize 
an inducible enzyme (8-p-galactosidase) in appropriate media. The possible ap- 


plications of these protoplasts in physiological and genetic research are cited. 


* Paper No. 625 of the Department of Genetics. This work has been supported by grants 
from the National Science Foundation and the National Cancer Institute, Public Health Service 
(C-2157). 
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HINDERED CIS ISOMERS OF VITAMIN A AND RETINENE: 
THE STRUCTURE OF THE NEO-b ISOMER 


By WILLIAM OrRosHNIK, Paut K. Brown, RutH HusBarpb, AND GEORGE WALD* 


ORTHO RESEARCH FOUNDATION, RARITAN, NEW JERSEY, AND BIOLOGICAL LABORATORIES 
OF HARVARD UNIVERSITY, CAMBRIDGE, MASSACHUSETTS 


Communicated July 9, 1956 


Under the above title we have recently presented the following argument:! (a) 
The configurations of the four unhindered geometrical isomers of vitamin A and 
retinene having been established (Fig. 1),? the neo-b isomer proved to be a fifth 
form, necessarily therefore possessing a hindered cis linkage. (b) Isomerization 
studies showed it to be monocis, therefore either 7- or 1l-czs. (c) An 11-cts vita- 
min A (neo-c) having been synthesized and shown not to be neo-b, it was concluded 


Neo-b neO-¢ 
| J he ¥ 
Se BNET ENG retinene 


that the latter must be 7-c7s. 


14 


Fic. 1.—Structural formula of all-trans retinene. The arrows 
point to the positions of c7s linkages in the other isomers, unhindered 
cis isomers below, hindered cis isomers above. . 


Compound II (Fig. 2), an intermediate in the commercial synthesis of vitamin A, 
was used to provide the terminal segment in our synthesis of neo-c vitamin A. 
This substance is prepared by the allylic rearrangement of compound I as a mix- 
ture of cis and trans isomers. We had used the predominant isomer, heretofore 
believed to be trans. On re-examination, however, this has now been shown to be 
cis and to have inadvertently introduced a second, terminal cis linkage into our 
synthesis. Neo-c vitamin A is therefore the 11,13-dicis isomer (Fig. 3).* 

tobeson et al.? have proposed the 11,13-dicis structure for neo-b retinene. This 
suggestion seems to be confuted by our finding that this structure actually belongs 
to neo-c, 

Using the trans isomer of compound IT, the 11-monocis isomer of vitamin A has 
now been synthesized (Fig. 3).* It is identical with neo-b vitamin A in ultraviolet 


and infrared spectra. It exhibits quantitatively the same relations in the anti- 
mony chloride reaction and on isomerization with light in the presence of iodine. 
A sample of this substance, oxidized to the corresponding aldehyde with manga- 


nese dioxide, displays quantitatively the same properties as neo-b retinene in the 
antimony chloride test and on isomerization with iodine and light.‘ Crystallized 


(neo-b retinene, 63.5°-64.4° 


’ 


from petroleum ether, it melts at 63.1°-64.4° C. 
60.8°-62.3°*). The mixed melting point of this product with neo-b retinene was 
61.5°-63.4°. A mixture of synthetic 11-cis retinene and cattle opsin, incubated 
in the dark, yields rhodopsin in the expected amount.‘ 

It was noted earlier that neo-c retinene is sterically unstable, isomerizing slowly 
even at —15° C. in the dark.! The realization that this is the 11,13-dicis isomer 
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Fig. 2.—Allylic rearrangement of compound I on 
treatment with acid yields cis and trans isomers of com- 
pound IJ. After these have been reduced to the cor- 
responding dienols (compound III), their configurations 
are established by testing with maleic anhydride. Only 
the trans isomer of compound III readily forms an adduct 
with this reagent. 
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11,13 -dicis (neo-c) 


i1-cis (neo-b) 


Fig. 3.—Structural formulas of neo-c and neo-b vitamin A. 
The 11-cis linkage is hindered in both molecules, owing to 
overlap between the 10-H and 13-CH, groups. 
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clarifies what was previously a puzzling observation. A. preparation of neo-c 
retinene, partially purified chromatographically, was concentrated in petroleum 
ether and stored at —15° C. It partly erystallized; but on examination the crys- 
tals turned out to be neo-a (13-czs). It is now clear that during storage the rela- 
tively unstable hindered 11-czs linkage had largely isomerized to trans, yielding this 
result. 

The positions of the long-wave-length absorption maxima (Ajax) of the neo-b 
and neo-c isomers are unusual. A hindered cis linkage in conjugated polyenes nor- 
mally shifts Ajax markedly toward shorter wave lengths. In neo-b vitamin A. and 
retinene, however, the displacement is only 5-7 my, the same as with an unhindered 
cis linkage. A terminal cis linkage, as in 13-cts (neo-a) or 9,13-diczs (iso-b) vita- 
min A, has heretofore been found to shift Amax 1-3 mu toward longer wave lengths, as 
compared with terminal trans configurations. The Aja, of neo-c vitamin A might 
be expected therefore to be 1-3 mu longer than that of neo-b; instead, it is 8-10 
mu shorter. The reason for these unexpected relationships is not immediately 
apparent. 

* This investigation was supported in part by grants to George Wald from the Rockefeller 
Foundation and the Office of Naval Research. 
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2 ©. D. Robeson, J. D. Cawley, L. Weisler, M. H. Stern, C. C. Eddinger, and A. J. Chechak, 
J. Am. Chem. Soc., 77, 4111, 1955; C. D. Robeson, W. P. Blum, J. M. Dieterle, J. D. Cawley 
and J. G. Baxter, J. Am. Chem. Soc., 77, 4120, 1955. 

3 W. Oroshnik, J. Am. Chem. Soc., 78, 2651, 1956. 
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ON THE SYNTHESIS OF ANTIBODY PROTEIN IN VITRO* 
By Donaxp F. SrTeINERt AND H. 8. ANKER 
DEPARTMENT OF BIOCHEMISTRY, UNIVERSITY OF CHICAGO 
Communicated by W. H. Taliaferro, July 10, 1956 


It is generally agreed that antibody formation is associated with organs contain- 
ing relatively large numbers of lymphoid cells.' Among these organs, the spleen, 
in particular, participates in antibody formation, especially when antigens are 
administered intravenously. Formation of antibody also occurs in nonimmunized, 
immature, or X-irradiated recipient animals into which spleen cells from immunized 
donors are injected. Previous observations? indicate that in the rabbit the major 
proportion of antibody protein in an anamnestic response is synthesized between 
the third and fifth days following intravenous injection of a soluble antigen. 

In vitro observations with spleen explants or slices have been restricted thus far 
chiefly to hemolysins,* * typhoid’ and paratyphoid® agglutinins, and diphtheria 
antitoxins.’ 

A method is reported here which permits the synthesis of antibody proteins by 
rabbit spleen cells on incubation for several days in vitro. In these preparations 
the cells are maintained with a minimum of degenerative changes and form pre- 


cipitable antibody against a soluble antigen in quantities comparable to the amounts 


found in in vivo transfer experiments. 
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MATERIALS AND METHODS 


Immunization Procedure.—Young adult male albino rabbits weighing 
were fed ad libitum. The rabbits received a single primary injection of 100 mg. 
of crystalline bovine serum albumin (BSA, Armour Laboratories) in saline. On 
the twelfth day blood was drawn from the ear vein and the serum titrated for anti- 
body. A wide range of serum antibody levels was noted, but only anergic animals 


were excluded. After a period of at least 1 month after the primary injection, a 


second dose of approximately 15 mg. of BSA was administered, and after the desired 
interval the spleen was removed aseptically under nembutal anesthesia. 

Analytic Procedures.—Antibody was determined by precipitation with antigen.’ 
Quantitative protein estimations on the precipitates were made by the method of 


4 


Lowry,’ using standardized bovine serum albumin as a reference. Glucose was 
determined colorimetrically. 

Air-dried smears of all cell preparations were stained with Wright’s stain, while 
wet preparations of living cells were observed directly under the microscope. In 
some experiments cell counts were made with a hemocytometer. 

Incubation Methods.—The spleen was weighed and minced and the fragments 
pushed through a No. 80 stainless-steel mesh screen with a glass pestle into a measured 
volume of medium under sterile conditions. Single-cell suspensions with only a 
few cell clumps were thus obtained. 

Eagle’s tissue culture fluid'! with increased salt concentration and 0.34 per cent 
glucose, Trowell’s medium’ with penicillin and streptomycin, and Medium No. 
199 of Morgan et al.'* were used. Varying amounts of serum were added to the 
media. 

Procedure A.—A suspension of cells was gently shaken at 37° C. in 30 ml. of 
medium in 95 per cent O.-5 per cent CO.. At the end of the incubation period the 
flask contents were centrifuged and the cells extracted after disintegration by sonic 
vibration. The extract and the supernatant medium were analyzed for antibody. 

Procedure B.—Figure 1 illustrates the incubation flask used in these experiments. 
Cellophane membranes were heated to boiling, soaked for 24 hours in distilled 
water, and stretched across the glass flange of the upper section. The flange of 
the lower section was greased with vaseline. After autoclaving, the flanges were 
fastened together by steel springs. The lower (reservoir) compartment was com- 
pletely filled with sterile medium, so that the membrane was flat and horizontal. 
Five milliliters of cell suspension were poured into the upper compartment and 
evenly distributed on top of the membrane by swirling. The cells thus settled 
uniformly on the membrane. The flasks were incubated at 37° C. with mag- 
netic stirring of the reservoir at 60 rpm; 95 per cent O.—5 per cent COs was con- 
tinuously passed over the cell suspension. About 10 minutes elapsed between 
splenectomy and the beginning of incubation. Five milliliters of the same suspen- 
sion served as a zero time control. After incubation, the fluid above the mem- 
brane was decanted and the cells adhering to the membrane examined histologically. 


RESULTS 


After very short periods cells incubated by procedure A began to show degenera- 
tive changes. The large immature lymphoid and reticular cells disintegrated, 
and only a few small lymphocytes were motile after 48 hours. Hypertonic media 
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did not prevent these degenerative changes. Glucose utilization under these con- 
ditions was much lower than in experiments where the cells remained at rest. No 
antibody was found either in the incubation medium or in the extract from dis- 


rupted cells. The results are tabulated in Experiments 1 and 2 in Table 1. 


TABLE 1 
Glucose 
Day of Period of Uptake . Antibody 
Splen- Incubation (Mg/Gm (Mg/ml) Cale. 
ectomy (Hours) * Mediumt Spleen/24 hrs.) Medium Mg/Gm Spleen 


24 oM-5°ORS 8 
18 EM-10°%RS 
72 As 
18 SM-50°RS 
18 )M-—50°,RS 
1] SM-50%RS 
} 50°°RS 
50% RS 
50% a 
50°,RS 
50% RS 
50°,RS 
50°,RS 
10°, RS 
-50°>HS 
HOC oe 
50° RS 
ih M-50°HS 
3 8 TM-50% HS 
TM only 
3 8 EM-50%RS . 6.2 


i we we) 


O10 5 
LLP: LFA? 


o.5 


* Cell counts ranged from 10? to 10* per milliliter. 
+ EM, modified Eagle's medium; TM, Trowell's medium; RS, rabbit serum or HS, hog serum in reservoir. 
t Incubation procedure A. 
§ Incubation procedure B. 
6 mg. l-C'4-glycine added to incubation medium; 250 counts/min/mg of dry BSA-antibody precipitate. 


Incubation by procedure B preserved all cell types normally seen in the spleen 
(Fig. 2). In particular, reticular cells, large lymphoid cells, and myeloid cells with 
granules were present (Fig. 3) for as long as 72 hours. Large numbers of motile 
lymphocytes were seen in wet preparations. However, after incubation for 5 
days without change of medium, many large mononuclear cells (polyblasts) and 
large multinucleated giant cells appeared (Fig. 4). No formation of fibroblasts 
occurred, and only few mitoses were seen. 

Antibody was formed when cells removed three days following the secondary 
challenge were incubated for 48 hours (Expt. 2). From 0.18 to 1.46 mg. of anti- 
body per milliliter of medium was found (i.e., 1.8-15 mg. per gram of spleen). 
No antibody could be detected either in the supernatant from the controls or in 
the extracts of the cells either before or after incubation. The antibody concen- 
tration in the medium continued to increase for 48 hours (Expt. 3) but remained 
constant after that (Expt. 4). Glucose consumption was higher than in procedure 
A (Expt. 5). Less antibody was found in experiments in which the spleen was 
removed two days following administration of the antigen (Expts. 6, 7, and 8), and 
no antibody was found either if the spleen was removed on the first day or if antigen 
was added in vitro. 

Maximal formation of antibody was found when modified Eagle’s or Trowell’s 
medium plus 50 per cent serum was used. Rabbit serum was added to the upper 
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Fic. 1.—Incubation flask used in procedure B 


Fic. 2.—Thick air-dried smear of rabbit spleen cell suspension 
before incubation. Numerous reticulum cells and small cells of the 
lymphoid series. Wright stain, «450. 
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section, and rabbit, human, or hog serum (Expt. 9) to the reservoir. Modified 
Eagle’s medium alone was insufficient. In Trowell’s medium, which contains 
higher concentrations of all the natural amino acids, approximately 75 per cent of 


Fig. 3.—Thick air-dried smear from the membrane surface after 
48 hours’ incubation without change of medium. Numerous large 
reticulum cells; abundant smaller lymphoid cells. Wright stain, 
x 450. 


Fic. 4.—Thick air-dried smear from the membrane surface after 
6 days of incubation without change of medium. Multinucleated 
giant cell (a) and numerous large polyblasts (6). Wright stain, 
xX 450. 


the maximal antibody production was observed, since the cells apparently survived 
for a sufficiently long period (Expt. 10). However, gross degenerative changes were 
evident histologically after 30 hours in all media in the absence of serum. 

In Experiment 11, 6 mg. of /-C'-glycine (3 wC/mM) was added. After incu- 
bation, equivalent amounts of egg albumin and anti-egg albumin antibody were 
added, the precipitate removed, and the anti-BSA. antibody isolated by precipita- 
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tion with BSA. Both precipitates were dialyzed against a solution of unlabeled 


glycine. No C' was found in the egg albumin precipitate. The isotope concen- 
tration of the BSA precipitate was about one-half that calculated assuming that all 
the glycine residues of the antibody were derived from the added labeled glycine. 


DISCUSSION 


It was found that incubation of surviving tissues by shaking in a suitable gas 
phase is unsatisfactory for prolonged survival of spleen cell suspensions, apparently 
because of mechanical damage of the cells by the agitation. Since antibody was 
not found, it appears that intact cells are required for the synthesis. 

To avoid damage by agitation, spleen cells were incubated at rest. To overcome 
the limitations of gas diffusion in a large volume of medium, the cells were supported 
in a thin layer by a semipermeable membrane. The layer of fluid above the cells 
was only about 1 mm. thick. The large volume of medium in contact with the 
underside of the membrane served as a reservoir of nutrient materials and permitted 
the removal of metabolic products. Exchange of materials was enhanced by stir- 
ring. On incubation of spleen cell suspensions by this method, cell degeneration 
was minimized and antibody was formed in such amounts that it could be precipi- 
tated directly. Since the antibody protein could not diffuse through the membrane 
into the reservoir, it was found in all experiments only in the medium in which the 
cells were suspended. 

Antibody formation was considerably lower if the spleen was removed 2 days 
after a secondary antigen injection. When the spleen was removed after 24 hours, 
no antibody formation could be detected, although all the cells were histologically 
fairly well preserved for 4 days. The formation of polyblasts and multinucleated 
giant cells after 5 days of incubation in vitro indicates that for this period of time 
the medium presently used does not sustain the cells in the same histological state 
as is maintained in the intact organ in vivo. The inadequacy of the culture medium 
probably accounts in part for lack of antibody formation when the spleen is re- 
moved prior to the second day after antigen administration. Similar effects had 
been reported previously* and attributed to a deficiency in the culture system. 

Even though serum is not absolutely essential for the synthesis of antibody pro- 
tein, extracellular protein appears to contribute to the preservation of the cells. 
It has likewise been noted that serum is necessary for maintenance of growth in 


‘t Tt cannot be decided at the present time whether the serum 


many cell cultures. 
effect is due to the protein itself or to low-molecular-weight factors associated with 
it. 

Taliaferro and Talmadge" have studied the anamnestic response of rabbit spleen 
cells removed 3 days following a secondary challenge with BSA after transfer to 
nonimmunized recipients. In these experiments a total of from 4.5 to 50 mg. of 
antibody protein per spleen was synthesized following transfer of one spleen to 
one or two animals after a challenging dose of 60-90 mg. of antigen, or about 4-5 
times the amounts used here. Although the magnitude of the antibody response 
as a function of the antigenic stimulus is not known, the amount of antibody pro- 
tein synthesized in vitro in the experiments reported here ranged from 3 to 37 mg. 
per spleen. The yield in previous in vitro experiments’ was only about one-tenth 
of the amount of antibody obtained in equivalent transfer experiments in vivo. 
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In all previous in vitro experiments’ antibody formation could be demon- 
strated only because of the high potency of the antigens used and because of the 
very sensitive immunological methods available for the detection of antibodies 
against particulate antigens and toxins. 

The method of incubation described here would seem to hold promise for other 
experiments requiring the maintenance of differentiated cells in a functional state 


in vitro. 
SUMMARY 


A technique has been developed which permits incubation at rest of large quanti- 
ties of rabbit spleen cells in vitro. Under these conditions net synthesis of pre- 
cipitating antibody against bovine serum albumin occurs. The amount of anti- 
body formed corresponds closely to the amount obtained after transfer of spleen 
cells to nonimmunized recipients in vivo. It seems that the success of this tech- 
nique is due to the prevention of damage by agitation and to provisions for ade- 
quate nutrition and aerobic metabolism of the cells. 


* Supported by a grant from the United States Atomic Energy Commission. 

t In partial fulfilment of the requirements for the M.S. degree. 
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PRODUCTION OF PROTOPLASTS OF ESCHERICHIA COLI 
BY LYSOZYME TREATMENT 
By Norton D. Zinper and WILLIAM F. ARNDT 

ROCKEFELLER INSTITUTE FOR MEDICAL RESEARCH 

Communicated by René J. Dutos, July 6, 1956 
Recently, investigators of some biochemical problems have made use of sub- 
cellular elements of bacteria in an attempt to study systems more highly integrated 
than soluble extracts but more amenable to external manipulation than intact 
cells. Cells which either have been sonically disrupted! or have had their cell walls 
removed (protoplasts)? have been used. Protoplasts have hitherto been prepared 


only from gram-positive organisms. Treatment of Bacillus megaterium with 


lysozyme in hypertonic media brings about a change from its normal rodlike form 
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to a spherical one and results in its susceptibility to lysis upon suspension in hypo- 
tonic media.* These altered properties are presumed to result from dissolution of 
the cell wall, the cytoplasmic contents then being retained by the cell membrane. 
Unfortunately, little is known of the genetics of the gram-positive organisms, and 
the fruitful concomitant biochemical and genetic attack is thus restricted. For this 
reason it would be of obvious advantage to obtain protoplasts of such organisms as 
Escherichia coli, which has been the subject of detailed genetic and physiological 
investigation. This report details the development of such a technique which also 
involves the use of lysozyme. 

Our thinking on the problem of obtaining protoplasts of 2. coli was influenced by 
the report of Hirsch‘ that some material isolated by him from rabbit leukocytes 
(phagocytin) would kill #. coli strains. Phagocytin had chemical properties similar 
to, but not identical with, those of lysozyme. When EL. coli cells were treated with 
crude phagocytin, (kindly provided by Dr. Hirsch) in hypertonic media at pH 5, 
they became susceptible to lysis upon dilution into distilled water. However, 
only a rare cell was morphologically identifiable as a protoplast. The difficulty in 
obtaining phagocytin in sufficient quantity for detailed investigation caused us to 
seek another source of active material. On the hypothesis that phagocytin was a 
lysozyme of different specificity, hen egg albumen, a rich source of lysozyme, was 
tested. Treatment of cells with a one-to-four dilution of fresh albumen resulted 
in the rapid conversion of cells to protoplasts at pH 5 and 9. At pH 5 the spheres 
were badly agglutinated. All further experiments, unless otherwise noted, were 
therefore done at pH 9. No activity of albumen was detected at pH 6-8. Crude 
fractionation of egg albumen revealed that the activity was associated with the 
lysozyme fraction. Commercial lysozyme (Armour Chemical Company) was 
equally effective. 

CONVERSION OF CELLS TO PROTOPLASTS 

The following procedures for the efficient conversion of cells of 2. coli strains 
K12 and B, and their derivatives, were empirically developed. Overnight broth 
cultures of cells are diluted tenfold in Difeo Penassay broth and incubated with 
aeration for 90 minutes. This takes the culture just out of logarithmic growth 
phase (cells less fragile) and gives a titer of 4-5 X 10° cells per milliliter. The cul- 
ture is washed once in saline by centrifugation and resuspended in saline. After 
3 minutes of incubation at 37° C., sufficient tris buffer (Sigma Chemical Company) 
and lysozyme are added to make the final concentrations 1 M and 200 ug/ml., 
respectively. Further incubation for 5 minutes converts the cells to protoplasts 
(Fig. 1). Other supporting media that can be used are sucrose 0.6 M with 0.1 M 
carbonate buffer, pH 9, or M/2 NaCl with carbonate buffer, or 0.3 M NH,Cl 
brought to pH 9 with NaOH. In these latter two media the cells will cytolyze 


upon dilution into hypotonic media but do not assume a spherical shape. Centrif- 


ugation of the preprotoplasts and resuspension in 0.6 .WZ sucrose will cause their 
immediate swelling into spheres. 

With regard to these procedures, some points need be mentioned: (1) At pH 
8-8.8, protoplasts are formed but are badly agglutinated. This agglutination can 
be prevented by the addition of 0.1 per cent Mg*+*; however, some 10 per cent of 
the cells retain their rodlike shape and are not subject to cytolysis. (2) It is requi- 





588 BIOCHEMISTRY: ZINDER AND ARNDT Proc. N. A. S. 


site that the cells be washed and starved slightly for quantitative conversion to 
occur. Perhaps they can heal the lysozyme-induced lesions if they are able to 
grow. (3) Certain strains will not form spheres in the treatment medium but will 
do so when resuspended in other media. In order for the cells to form spheres, 
they apparently must imbibe some water and swell—in a sense, hatch out of their 
cell walls. As yet there is no rationale for the preferential use of any medium, 
other than its empirical success with the strain in question. 


PROPERTIES OF PROTOPLASTS OF £. col7 


1. General.—Growing cells of F. coli are generally in a doublet condition. Each 
doublet gives rise to two protoplasts, with separation occurring at the point of in- 
cipient division. Motile strains lose their motility when converted to protoplasts. 
The spheres do not detectably adsorb bacteriophage. The protoplasts readily re- 
duce methylene blue. 

When protoplasts are suspended in broth media, they lyse no matter how high 
the osmotic pressure of the medium and whether it is maintained by sucrose, NaCl, 
or dextran. Addition, to a final concentration of 2 per cent, of bovine serum 
albumin prevents their lysis and makes further study possible. The following 
medium (P-broth) was used to study the growth of bacteriophage by, and the 
viability of, the protoplasts: Difco Penassay broth was supplemented with 0.6 M 
sucrose, 2 per cent serum albumin, and 0.1 per cent Mg** (this prevents agglu- 
tination at the lower pH of this medium and may be required for growth proc- 
esses). 

2. Growth of Bacteriophage.—Protoplasts of FE. coli B will grow bacteriophage 
T4. Cells were grown as described, and suspended, after washing, in saline con- 
taining 100 ug/ml. l-tryptophan (necessary for phage adsorption). Phage was 
added so as to give one particle per hundred bacteria, and after 5 minutes of incu- 
bation for adsorption and injection the cells were converted to protoplasts in tris 
buffer. Infective center titers (plaques) were obtained for protoplasts, lysed pro- 
toplasts, and cells by plating in M/2 NaCl phage agar (8 per cent tryptone, 0.1 per 
cent glucose, and agar) with an excess of 2. coli B. Suitable samples were incubated 
in P-broth to determine final yields. Table 1 shows the results of such an experi- 

TABLE 1 
GROWTH OF BACTERIOPHAGE T4 IN PROTOPLASTS 
AND CELLS oF E. coli B 


(Number of Plaques per Plate, Had Equivalent 
Volumes of Equivalent Dilutions 
Been Plated) 
INCUBATION INCUBATION 
TIME TIME 
0 Min. 90 Min. 0 Min. 90 Min. 
Cells 350 21,100 Lysed protoplasts 3 12 
Protoplasts 83 15,500 


ment. About 20 per cent of protoplast infective centers are recovered directly. 
Only 1 per cent of infective centers are recovered from lysed protoplasts. These 


probably represent unadsorbed and desorbed phage, as there is little rise in the 
titer with time. Protoplasts give almost as large a yield as the untreated cells, 
indicating that the initial loss of protoplast infective centers is due to their prema- 
ture lysis on the plates rather than to inability to grow phage. 
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3. Viability—For 4 hours after suspension and incubation in P-broth, pro- 
toplasts exhibit no gross morphological changes as observed by phase-contrast 
microscopy. At this time, the majority of the protoplasts start to shrink, become 
highly refractile at their periphery, and assume a coccobacillus form. Their fate 
is still uncertain, but they probably represent a degenerate state. On the other 
hand, about 10 per cent of the protoplasts increase in volume and form a large 
vacuole. From the side distal to the vacuole, finger-like processes appear which 
grow into rods (Fig. 2). The mother protoplast becomes extremely distorted, 


ix @ 
X.Y e 


* 


Fig. 1.—Phase-contrast microphoto- 
graph of wet mount of protoplasts. 
Note nonswollen rod for size compari- Fic. 2.—Phase-contrast microphoto- 
son between the rods and_ spheres. graph of wet mount of protoplast 
X 1,700. budding off a rod. 1,700. 


and a wide variety of morphological abberations are observed, reminiscent of the 
growth of L-forms from large bodies.’ Direct demonstration of the viability of the 
protoplasts has been obtained by plating them in P-broth to which sufficient agar 
to just solidify the mixture was added. The equivalent of from 2 to 20 per cent 
(different experiments) of the treated cells formed normal colonies. On no other 
solid media have the protoplasts returned to the actual bacillary form. 
DISCUSSION 

Although lysozyme has been claimed to be without effect on EF. coli cells, it was 
known to be bound by them.® In our studies it was clear that at pH 6-8 lysozyme 
was without effect, whereas its activity was extremely rapid at pH 8 and above. 
It is interesting to note that suspension of F. coli in the alkaline treatment medium, 
in the absence of lysozyme, brings about, within 30 minutes, the slow lysis of the 
cells upon their suspension in hypotonic media. This action may be analogous to, 
though less efficient than, that of lysozyme, which is itself a basic protein. Certain 
classes of bacteriolytic agents may have as their primary site of activity the cell-wall 
substance, with the lysis of the cell being due to secondary osmotic effects following 
the dissolution of the cell wall. In the course of these investigations it was noted 
that the lysis of EF. coli by phage ghosts (phage membranes obtained by osmotic 
shock) went through a protoplast stage. However, no suitable stabilizing medium 
for such protoplasts was found. Thus three biological reagents—phagocytin, 
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lysozyme, and phage ghosts—are available for efficient lysis of 2. cold cells for 
preparation of bacterial enzymes, nucleic acids, cell membranes, etc. 

The protoplasts of #. coli have retained—as had their predecessors, the proto- 
plasts of B. megaterium’—sufhicient of their synthetic capacities to synthesize bac- 
teriophage. This may be taken as a general indication of their biochemical in- 
tegrity. The difference in the viability of 2. coli and B. megaterium protoplasts 
may be only a reflection of the different media used in the different studies. As 
pointed out, there has been considerable variation in the viability of the FE. coli 
protoplasts. This may depend on the amount of residual cell-wall substance left 
on the protoplast and/or the soluble materials lost during the conversion process. 

With the ready accessibility of protoplasts of F. coli, investigation of their genetic 
potentialities is under way. 

SUMMARY 

The cells of two strains of £. coli can be quantitatively converted into spherical 
protoplasts by the action of lysozyme at pH 9. The protoplasts are capable of all 
the biosyntheses necessary for phage growth. About 10 per cent of them are also 
capable of reverting to normal rod forms when incubated in an appropriate medium. 
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ION ASSOCIATION IN AQUEOUS SOLUTIONS OF 1-1 
ELECTROLYTES AT 25° 
By Mario AzzArRI* AND CHARLES A. Kraus 
METCALF RESEARCH LABORATORY, BROWN UNIVERSITY 
Communicated June 22, 1956 
According to Onsager’s theory, the conductance of an unassociated electrolyte 


in aqueous solution decreases with concentration in accordance with the equation 


A = Ao “+ (aAg + B/C. (1) 


It has been found, however, that, while the conductance approaches the theoretical 
value in this solvent as C approaches zero, the observed values of A at higher con- 


centrations are somewhat higher than those predicted by theory. The deviations 
from theoretical values increase with concentration approximately as a linear func- 
tion of C. We thus have Shedlovsky’s empirical extension! of Onsager’s equation. 


A = Ao — (Ana + B)YC + BC, (2) 


where B is an empirical constant. 
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While conductance values of inorganic salts quite generally conform to equation 
(2) with a positive value of B, this is not the case with many quaternary ammonium 
salts.” It has been suggested that the observed negative deviations from Onsager’s 
equation are due to ion association. However, up to now, the correctness of this 
hypothesis has not been established. It is our purpose, in this paper, to determine 
whether or not ion association actually occurs in aqueous solutions. 

Before proceeding to an examination of available data, we need to consider the 
equilibrium in a dilute electrolyte solution in which ion association occurs to a very 
small extent. 

If ion association occurs, the conductance is given by the equation 


A = Ao — (akg + B)VC + BC + A,A, (3) 


where A,A is the conductance decrease due to ion association; thus it is always 
negative. If AoA is the deviation from the simple Onsager equation, A,A is given 
by the equation 
AVA = AoA = BC. 
The value of AoA is approximated fairly closely by the equation 
AoA = A’ = Apo, 
where 
A+ BYC 


A’ ; 6 
l1—avVC (6) 


Up to 0.01 N, the values of AjA may be quite closely approximated by means of 
equations (5) and (6). AoA is usually, but not always, negative if ion association 


occurs. 
The degree of association, 1 — y, is given by the equation 
— A,A —(AyA — BC) 


b eg = 7) 
¥ A = Aa A A ain AoA + BC ( 


where y is the degree of dissociation of the electrolyte. If the equilibrium between 
ions and ion pairs conforms to the law of mass action, we have 


lL (A — AA + BC)Cy?f? (8) 
ae BO — AA 


where K, is the association constant of the electrolyte. If A, is known, the value 
of A,A is given by the equation 
: AK, Cy2f? 
—A,A = —(AoA — BC) = ao (9) 
1 — K,Cy?f? 
In the limit, the value of the activity coefficient, f, is given by the Debye-Hiickel 
equation 


— log f = BVC (10) 


At higher concentrations, it is more precisely given by the equation 
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BYC 
°6. 1 + xa 


where x = 0.3288C X 10‘ and ais the sum of the radii of the ions.* For an aqueous 
solution at 0.01 N the value of f?, calculated for a = 5 A, is approximately 3 per 
cent smaller than that calculated on the basis of equation (10). 

The value of f may be approximated fairly closely on the assumption that y 
equals unity. The value of B, on the other hand, is best approximated by the trial- 
and-error method. It may, of course, be calculated by equating the expression for 
1/K, as given by equation (8) for two different concentrations and solving for B, 
but, if the data for either concentration are in error, the values of B will also be in 
error. The latter method may at times be employed in a rough approximation. 

For many salts the value of B is small and values of AK, may be approximated 
fairly closely on the assumption that B = 0. The values of K, so found will vary 
only by a few per cent at higher concentrations. If B is large, the value of K, so 
found will decrease greatly with increasing concentration. 

In any case, values of A, must be calculated for different values of B, and that 
value of B is the most probable which yields a K, which reproduces experimental 
AoA values most nearly. 

Conductance data are available for ten electrolytes that exhibit negative devia- 
tions from the simple Onsager equation. The limiting slopes of the AjpA versus C 
plots range from —32 for MesNBr to —286 for BusNI.? For two of these salts 
data are available up to 0.01 N; for the remainder the measurements were not 
carried above 2-3 X 10-* N, but the deviations of four salts are sufficiently large 


to permit a fairly satisfactory analysis. For the remaining four salts the deviations 


are too small to permit a satisfactory determination of association constants. 

In Table 1, Sections A-F, are presented the results of our calculations. Values 
of Ao, B, and K, employed in the calculations are given at the head of each subtable. 
Concentrations are given in the first column, experimental values of ApA, the de- 
viations from Onsager’s equation, in the second column, and the differences between 
calculated and experimental values of AoA in the third column. Underneath the 
last column appear the average deviations of the calculated from the experimental 
values of AoA for each salt. 

Three parameters are involved in the calculations: A», the ion size a, and the 
coefficient B of the Shedlovsky equation. For five of the salts Ag was adjusted 
slightly upward by, at most, 0.02 A-unit. For one salt, Pr,NI, it was decreased 
0.093 A-unit. In the original evaluation of Ao for this salt, extrapolation was car- 
ried out from 7.5 X 10-4 N. In dilute solutions, K, is highly sensitive to errors in 
Ao. 

In the present calculations we have assumed a = 5 A for all salts with the excep- 
tion of Me,NI, where we assume a = 3 A. The calculated value of K, is not very 
sensitive to the value assumed for a. For example, in the case of BuyNI, the value 
of f? calculated on the basis of a = 5 A differs from that calculated by means of 
equation (10) by only 3 per cent at C = 0.01 N. Thus the error introduced into 
the value calculated from K, is only 3 per cent. 

With the exception of Me,NI, the value of B is small for all salts. For these 
salts one obtains a rather close approximation to the measured values of ApA by 
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assuming B = 0. In the case of Me,NI, however, values of ApA calculated on this 
basis differ widely from the experimental ones. 

It may be noted that the K, values employed in these calculations are in all cases 
those obtained for the highest concentrations for which measurements have been 
reported. 


TABLE 1 


COMPARISON OF CALCULATED AND EXPERIMENTAL V 
OF AjA FOR DIFFERENT ELECTROLYTES 

AoAexp AoAexp 

AoAexp AoAcale Cc xX 108 AoAexp AoAcale 

A. BusNI* MeNIt 

Ao = 96.291; = 25; K 3.869 ’ 2 ; B= 235; Ke 3.153 
13809 ; +0. O14 16496 0.019 —0.001 
25416 at 000 2.95012 040 + 002 
§. 52156 o2 O10 5.39209 067 + 002 
0437 5 t+ O02 9 26201 096 — .009 
O819 ¢ 000 5.1065 164 + 007 
6093 ¢ 002 23 . 8422 229 + .006 
735 358 0.000 38.3611 305 — .002 
0.004 17.2754 412 + 006 
1991 0.439 0.000 
a.d. = 0.004 


AoAexp AoAexp — 
AoAexp AoAcale a AoAexp AoAcale 
C. PraNIt D. i-AmsNNOs* 

Ao = 100.287; B = 0; Ka 2.387 Ao 89.282; B= 0; Ka = 1.995 
7.51895 0.164 0.000 83507 0.045 —0.003 
6598 234 + 005 5.68994 095 + .001 
7.6650 369 + (02 4082 185 + .003 
4199 0.603 0.000 22.8073 347 000 
a.d. 0.002 37.6061 0.548 0.000 
ad. = 0.002 


E. n-BusaNNOst . F. BusNBrt 

Ao = 90.900; B = 20; Ka = 1.723 Ao = 97.470; B = 0; Ka = 1.342 
5.24798 0.067 +0.001 36593 0.012 —0.005 

76340 095 000 70109 038 + .004 
11.1045 138 + 004 5.39813 065 — .001 
16. 1205 189 — .001 7507 124 — .003 
23.1355 265 + 001 4209 0.242 0.000 
35.8263 0.391 0.000 ad. = 0.003 

ad. = 0.002 


2. W. Martel ana C. A. Kraus, these Procerpinas, 41, 9, 1955. 
L. Swarts and C. A. Kraus, these Procerpinas, 40, 382, 1954. 


*} 
tt 
TI 


2. G. Baker, thesis, Brown University, 1951 

The only test we have for the possible validity of the assumptions underlying 
these calculations in the closeness with which the calculated values of AjA agree 
with those found experimentally. Since the value of f is only slightly affected by 
the value assumed for the parameter a, there are only two major assumptions. 
These are (1) that the ions of the electrolyte interact to form a neutral molecular 
species whose molecules are in equilibrium with the free ions in accord with the law 
of mass action and (2) that any deviations of the conductance of the free ions from 
Onsager’s simple equation are positive and substantially linear up to the highest 
concentrations measured. As may be seen from Table 1, section A, the calculated 
and experimental values of AjA for BuyNI are in rather good agreement. The 
average difference between the two is only +0.004 A-unit. For this average, two 
values have been included in the dilute range, where the differences are +0.014 
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and —0.010. Considering the numerous different sources of error inherent in the 
measurements of the conductance of electrolyte solutions, the agreement is remark- 
ably good. 

As may be seen from section B of Table 1, the agreement for Me,NI is quite as 
good as or better than that for BuyNI. The concentration range is from 1.46 X 
10-4 to 96.2 & 10~4, a 60-fold concentration increase. The average difference be- 
tween calculated and experimental values of AA is 0.004 A-unit, or slightly less 
than 0.004 per cent. The maximum difference is 0.009 A-unit. For the two salts 
for which we have data up to 0.01 N, the agreement is clearly within the limit of 
experimental error. 

In Figure 1 are shown plots of AjyA and A,A versus C for MesNI and Bu,NI. Here 
the plots for AjA represent the negative deviations from Onsager’s equation as 
measured. The continuous line represents calculated values; the experimental 
values are represented by circles; the radii of the circles correspond to 0.05 A-unit. 

The plots of A,A represent 





the conductance decrease 
due toion association. The 
degree of association, | — 
vy, 1s equal to —A,A/(A — 
A,A). Obviously, a plot of 
| — y versus C would par- 
allel the plot of A,A versus 
C fairly closely. 

The values of K, for the 
two salts, 3.153 (Me NI) and 
3.869 (BusNI), do not differ 
greatly; the two salts are 





associated to much the same 








degree, but the values of 
20 40 , 60 . 80 100 AoA differ greatly. I his is 
C x 10% due to the fact that for the 





Fig. 1.—A,A and AoA as functions of concentration. first salt B = 235 and for the 
second salt B = 25. 

It was possible to determine B fairly closely for BusNI, despite its small value. 
Assuming B = 35, the difference between calculated and experimental values of 
A,A for four different concentrations, beginning with the highest, were respectively 
0.000, —0.006, —0.002, and 0.000. As shown in Table 1, section A, assuming B 
= 25, the differences for the same concentrations are respectively 0.000, —0.002, 
0.000, and +0.002. At high concentrations, a difference of 0.006 exceeds the ex- 
perimental error. The values of AjA as calculated for B = 35 are too large because 
the value assumed for B is too large. 

At low concentrations the differences in the calculated values of ApA on changing 
B from 25 to 35 are too small to be significant. It is only by means of data at 
concentrations above C = 17.85 & 10~4 that we are able to evaluate B with any 
degree of assurance. It is obvious, therefore, that, for the four salts for which data 
are available up to only about 3 xX 10~* N, it will not be possible to evaluate B 
successfully. 
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With BusNNOs we get a good agreement using B 20, but with AmyNNO; 
we get an equally good agreement with B = 0. It would appear that, for the four 
salts here in question, B is small, probably less than 25. <A precise evaluation 
of B is not possible on the basis of existing data. 

The plots of A,A for MesNI and Bu,NI as shown in Figure 1, differ very little, 
for the A, values are not greatly different and A» is markedly the greater for MesN1. 
The plots of ApA, however, differ greatly. For BuyNI the plot of AoA differs little 
from that of A,A, since B has a value of only 25. For Me,NI, on the other hand, 
B is equal to 235 and AA is very small. The differences between the ordinates of 
the A,A and A;A plots are equal to the values of BC. 

For five of the salts the value of B is very small, not greater than 25. For Me NI 
the value of B is surprisingly large. Our knowledge of B values for different 
electrolytes is very limited. For KCl, KBr, and KI, Owen and Zeldes* report the 
following values at 25°: 145.0, 91.8, and 121.1, respectively. At 55°, for the same 
salts, they report 177.3, 184.0, and 267, respectively. Swarts and Kraus report 
B values as shown in Table 2. The AjA versus C plots are all closely linear up to 
about 3 X 10~* N, the highest concentration measured. It may be noted in this 
connection that the deviations reported by Owen and Zeldes are all closely linear, 
with the exception of KCl at 25°. It seems very likely that the deviations of the 
salts that we have examined in this paper are nearly linear, sufficiently so that any 
departure from linearity would not affect our results significantly. 


TABLE 2 
VALUES OF B ror DIFFERENT SALTS 
IN WATER AT 25° 

Salt B Salt B 

Nal 133 MeN BrO; 67 

NaPi 166 Bu,N BrO; 93 

MeNPi 128 Me,NCl 16 

EtyN Pi 100 MeyNNO; 14 
As Swarts and Kraus have shown, the negative deviations of quaternary am- 
monium salts decrease as we go from large to small positive ions; so, also, they de- 
crease in going from iodide to chloride ions. It is of some interest to note that, in 
liquid HCN, where all deviations are positive, the positive deviations increase in 

the same order in which the negative deviations decrease in water. 

The values of K, which we have calculated are not in accord with Bjerrum’s 


theory. According to this theory, association should increase as the size of the 


ions decreases. As our results show, the contrary is true; ion association decreases 
as the ions are smaller. This is clearly shown by the K, values of Table 1. Most 
striking is the decrease in going from iodide to bromide ions; the association con- 
stant for BusNBr is approximately one-third that of BusNI. In this connection, 
it may be noted that for MesNCl, B = +16 and the AjA versus C plot is strictly 
linear up to 33 X 10-4 N, the highest concentration measured. Thus it is highly 
probable that Me,NCl is unassociated in water. 

Bjerrum’s equation, which relates the association constant of the ion-pair equi- 
librium with the dielectric constant and the temperature, contains a parameter a. 
This parameter may be interpreted as the distance between the centers of charge of 
the ions when in contact. For many electrolytes in solvents of lower dielectric 
constant for which K values have been determined on the basis of conductance 
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measurements, the parameter @ has a value ranging from 4 to 8 A and increasing 
as the ions are larger. In other words, the A values are in reasonably good accord 
with Bjerrum’s theory. In water, however, the a values derived from the K, 
values of Table | are not in accord with this theory. For BuyNI the value found 
for a, calculated on the basis of A, = 3.87, is only 1.02 A. Clearly, in this instance, 
the parameter a no longer has physical meaning. For BusNBr, the value of a is 
found to be 1.93 A. In other words, the parameter a for the bromide is nearly 
twice that of the iodide. 

We are presently carrying out an analysis of conductance data for solutions in 
dioxane-water mixtures. We shall report the results in a later communication. 


* Fulbright scholar from the University of Rome. Postdoctoral fellow in chemistry at Brown 
University. 

1'T. Shedlovsky, J. Am. Chem. Soc., 54, 1905, 1932. 

2 E. L. Swarts and C. A. Kraus, these PRocEEDINGs, 40, 382, 1954. 

*H. 8S. Harned and B. B. Owen, The Physical Chemistry of Electrolytic Solutions (New York: 
Reinhold Publishing Corp., 1950), pp. 32-39, 120. 

‘B. B. Owen and H. Zeldes, J. Chem. Phys., 18, 1083, 1950. 


THE OPTICAL ROTATORY DISPERSION OF 
SIMPLE POLYPEPTIDES. I 


By Wituiam Morrirr and JEN Tst YANG 
DEPARTMENT OF CHEMISTRY, HARVARD UNIVERSITY, CAMBRIDGE, MASSACHUSETTS 
Communicated by G. B. Kistiakowsky, June 4, 1956 


In a recent communication! the optical rotatory powers of helical macromolecules 
were investigated. It was shown that, under favorable circumstances, an abnormal 
frequency dependence should lead to marked departures from the simple Drude 
equation. 

The purpose of the present note is twofold. In the first place, we shall propose 
an alternative phenomenological equation to describe the more complex dispersion 
that is sometimes observed for helices. The equation involves three constants, two 
of which should be more or less invariant properties of the helical skeleton itself; 
the latter should be independent of temperature, solvent, and the nature of the side 
chains for a given skeletal conformation. We shall then describe the observed dis- 
persion for poly-y-benzyl-L-glutamate and for poly-a-t-glutamic acid, a preliminary 
account of which has been published elsewhere.” * There is strong experimental 
evidence that, under the right conditions, both polymers exist in the a-helical con- 
figuration ;* ® it also seems clear that the screw senses of the helices are well defined, 
though whether these are right- or left-handed for given side chains remains to be 


established. The rotatory dispersion curves are anomalous in both cases, as had 


been predicted. Their more detailed analysis is also in satisfactory accord with the 
phenomenological equation proposed as a result of the theory. 

In a forthcoming note,® the molecular theory is applied in greater detail to the a- 
helix. On the basis of dichroic studies of myristamide by Peterson and Simpson,’ 
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it is suggested that the experimental evidence presented here is consistent with the 
view that the screw sense of both helices is right-handed. 
ANALYSIS OF THE DISPERSION 

It is profitable, at first, to consider why so many systems which are transparent 
through the near ultraviolet should show the simple dispersive behavior of a single- 
term Drude equation. In order to facilitate comparisons with the formulas of 
molecular optics, we shall begin by defining the most convenient quantities with 
which to work. 

Instead of the specific rotation [a], it is preferable to use what may be called 
“effective molecular rotations’’ [m’] defined by the relations 


is a fal|M 
[m| = ; (1) 
2 2,° + 2 100 


Here n, is the refractive index of the medium, the dispersion of its polarizability for 
light of different frequencies v being thereby eliminated. / is the molecular weight 
of the optically active solute species; for polymers it is the molecular weight per 
residue, when [m’] is called the “effective residue’’ or monomer rotation. 

If the asymmetric molecules are in their ground states and interact neither with 
themselves nor with the solvent, then the frequency dependence of the effective 
molecular rotation is given by the quantal formula’ 


a,v" 967N 5 


[m’] = > — ~ F Tt, (2) 
ic 


i | flan A sae 
where r; is a molecular parameter (the rotational strength) associated with the 7th 
absorption band, which occurs at frequency »; in the electronic spectrum. Under 
these conditions, the identity® 


>a, = 0 = Dr, (3) 


1 


holds. However, since measurements are almost always conducted under less 
idealized circumstances, equation (2) may be used only if it is acknowledged that 
the rotational strengths r;, of the various absorption bands are functions not only 
of the solute molecules but also of the temperature and of the solvent. More- 
over, since asymmetric molecules may be expected to have asymmetric solvation 
shells, equation (2) cannot describe the dispersion through regions of incident 
light frequency where the solvent itself absorbs. 

Now suppose that we have to analyze the dispersion of simple molecular systems 
which only begin to absorb at wave lengths appreciably shorter than those for 
which the rotatory power has been measured. It will always be possible formally 
to expand the terms of equation (2) in inverse powers of (A? — Ao”), where the wave 
length Xo is as yet undetermined: 


Yaar? Vard2A? Tr Ao”) 


x : ——— ++ O[(A? — do?) ~4]. (4) 
(2 oon No?) (\? — Xo")? 


[m’] = 


If \2 > Ao?, A,2, this may be expected to converge quite rapidly. Now let Ao? be 
determined by the relations 
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Ado! = oars, Qo? = Da,’ 
t 1 


The coefficient of (A? — Ao?) ~* on the right-hand side of equation (4) vanishes. Ac- 
cordingly, if this Ao? is positive and appreciably less than \?, the dispersion is well 
described by the simple formula 


Myo" ; 
X2 — do? (6) 
This is, of course, just a slightly modified form of the single-term Drude equation. 
Since the dispersion of the factor 3/(n,? + 2) is relatively small, it is effectively iden- 
tical with this equation, whose empirical coefficients ap, A» receive their interpreta- 
tion from equations (5). The optical rotatory dispersion of a large variety of mole- 
cules, including simple amino acids, di- and tri-peptides, obey equation (6) Over : 
wide range of frequencies.° 

It is difficult to lay down useful conditions for the validity of the simple Drude 
equation. We may note that, from equation (5), the contributions of the lower ex- 
cited states are heavily weighted in determining A»”; if these states are fairly close to- 
gether, and the sum of their rotational strengths is larger than any one of these, 
then it is clear that Ao? satisfies the required inequalities, particularly since equation 
(3) will be at least approximately valid. Conversely, if the sum of the rotational 
strengths of the lower excited states is small with respect to any one, then we may 
expect the dispersion to be complex, and equation (6) will not be satisfied. It is 
just this possibility—-which must be expected whenever a principal source of the 
rotatory power is the asymmetric disposition of identical chromophores within a 
molecule—to which we attribute the anomalous dispersion of certain tartrates and, 
in particular, of simple polypeptides in a helical configuration. 

In a helix of any great length, allowed transitions, which alone contribute to the 
optical rotatory power, are polarized either parallel or perpendicularly to the screw 
axis. (By “allowed’’ we do not imply that the bands need be strong.) For each 
parallel band there is an associated perpendicular band, and conversely. More- 
over, pairs of such associated bands are rarely separated by more than 100 A or so; 
indeed, owing to their width, they will generally overlap. A single monomeric 
chromophore, like the peptide bond of a single residue, will appear in the spectrum 
of the a-helix as just such an overlapping pair of bands. More generally, this will 
be true for all monomeric chromophores which are prevented, on sterie grounds or 
by the interposition of insulating groups, from conjugative interactions with one 
another. For these important, longer-wave-length systems the rotational strength 
of the parallel component may be exceedingly large, but it is almost exactly com- 
pensated by the opposing rotational strength of its perpendicularly polarized part- 
ner. The partial effective monomer rotation due to such a band pair may therefore 
be written! 

QA 72 a,Ai7 
A? — rij? rn? — Ai? 

ar," bir; 
Mw— AZ | (AP — AP)?’ 


‘) 
Lm; | 
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since < Ne -> . \, 7], and 


Formally, we therefore adopt the modified equation 


= «kee biA,3 
m') => + (9) 


cA’ Ap 


where relation (7) has been extended over all pairs of band systems; in so doing, 
however, we recognize that b; may occasionally be very small but that for certain 
terms a, and b, are quite comparable. It is evident that, in the latter eventuality, 
the dispersion cannot be represented by a simple expression like equation (6) over 
any sizable range of wave lengths, except in very special circumstances. 

A phenomenological form for the anticipated dispersion is obtained in much the 
same Way as equation (6) is derived from equations (2). It takes the form 


Ayo” boAo' 
obs 


(A? — Xo?) (X2 — Ag?)” 


al 
| 
J 


[m (10) 


where the constants do, bo, Xo are chosen so that 


ApAo” Ya Ai’, 


boAo* = DV lard2(r? — No”) + BA,'], 


[a rA2(A2 — Ao2)? + 2A M(AZ — Ao?)] = O. 


It will be valid whenever Ao”, so determined, is positive and considerably less than 
\*. We note that the last relation involves the sixth powers of the wave lengths. 
In such systems, even more than for the more simple systems discussed earlier, the 
dispersion is heavily dependent on the long-wave-length absorption bands and is 
only slightly influenced by the bands lying far out in the ultraviolet. In the event 
that Ao lies in the region of the longer-wave-length spectrum, it may happen that 
\;? — Ao?) is small with respect to \,°, and equations (11) may be replaced by the 


simpler relations 
Ando? = DL aar?z, 
boro? = Sas 
Sid in Sia 


According to the molecular theory underlying the statements made in this para- 
graph, the parameters b,, and the wave lengths A, for which these parameters are 
sizable, should be intrinsic properties of the helical skeleton; they should be in- 
sensitive—as should the constants bo, Ao which they, in large measure, determine— 
to environmental factors such as arise from changes in solvent, temperature, and 
the nature of the side chains, always provided that this helical skeleton is not 


destroyed. 
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We do not, of course, claim that this analysis is any more than formal. Any re- 
sults based on it are not unique. But it is no more possible at the present time to 
give an operational prescription for the validity of the Drude equation (6) than it 
is to say when our equation (10) may be expected to hold. — Less so, for if it is found 
that bo, Ao are indeed invariant properties for a given helical skeleton, under modifi- 
cations affecting a) appreciably, our interpretation of these parameters in terms of 


equations (11) or equations (12) is greatly strengthened. 


EXPERIMENTAL RESULTS 


a) Poly-y-benzyl-i-glutamate (MW, 130,000).—At 20° C. this polypeptide exists 
as a helix in 1,2-dichloroethane, N,N-dimethylformamide, 1,4-dioxane, and chloro- 
form. Its optical rotatory power in these solvents is in sharp contrast to that which 
the polymer displays when dissolved in dichloroacetic acid at the same tempera- 
ture. In the latter solvent the intramolecular hydrogen bonds are destroyed and 
the molecular configuration assumes a randomly coiled form. 

The specific rotation of polybenzylglutamate in dichloroacetic acid decreases 
monotonically from —10° at 7500 A to —65° at 3400 A; the dispersion is simple 
and obeys the Drude equation, with A» in the neighborhood of 1900 A, throughout 
this range of wave lengths. In the “helical’’ solvent, ethylene dichloride, how- 
ever, the specific rotation is around +8'/2° at 7000 A, rises to a maximum value of 
about +10'/.° at 5700 A, and then decreases markedly, attaining the value of 
—181° at 3130 A. The behavior in chloroform, dimethylformamide, and dioxane is 
also anomalous, showing the same qualitative features as the ethylene dichloride 
solution. However, [a] is solvent-dependent: at 4360 A it takes the values +11°, 
—11°, —3°, and +1° for these respective solutions. 

Our results for the “helical’”’ solvents were fitted to equation (10) in the following 
manner. The specific rotations [a] were first converted to effective monomer rota- 
tions [m’], using Sellmeyer’s formula to obtain suitable extrapolations for the dis- 
persion of the refractive indices nv. [m’](A* — Xo?) was then plotted as a function of 
(A2 — Xo?) ~! for trial values of Xo until a straight line was obtained. The procedure 
is illustrated for the ethylene dichloride solution in Figure 1. It is seen that the 
curvature depends fairly strongly on Ao, whose value is therefore determined as 
2120 + 50 A. The slope of the line determines bo, and its intercept on the ordinate 
axis determines ay. For the other solutions \) was found to be 2120 A also, within 
the limits of experimental errors. The results for these are collected in Figure 2 
(solid lines). They may be expressed analytically in terms of equation (10), using 
Ay = 2120 A, together with the parameters assembled in Table 1. It will be re- 
marked that, in addition to the approximate constancy of the critical wave length, 
the slopes of the lines are almost the same for all four solutions: both bo, A» appear 
to be substantially independent of the nature of the solvent, provided the helical 
configuration is maintained. The solvent-dependent term is clearly do. 

Solutions of polybenzylglutamate in 1:3 ethylene dichloride: dichloroacetic acid 
show a most interesting behavior.? At 21° C. the rotatory dispersion is normal and 
much the same as in pure dichloroacetic acid at the same temperature: the poly- 
peptide is randomly coiled. On raising the temperature to 41° C., however, it has 
been shown that the viscosity increases markedly (and reversibly) and that the opti- 
cal rotatory power becomes anomalous: the helix is now the stable form. At the 
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higher temperature the effective monomer rotation is very closely fitted by means 
of equation (10) with Ay» = 2120 A, as before; bo is almost identical with that for 


solutions in pure ethylene dichloride at 20° C., although the intercept parameter ao 
is different. Their values are listed in Table 1. 


TABLE 1* 
PARAMETERS DESCRIBING THE OptTicaL ROTATORY 
DISPERSION (DEGREES) 


Poly-y-benzyl-L-glutamate in 

Ethylene dichloride 

Dioxane 

Chloroform 

Dimethylformamide 

1:4 Ethylene dichloride: dichloracetic acid (44°C) 
Poly-a-L-glutamic acid in 

Dimethylformamide 

1:2 Dioxane: water (0.2 M NaCl, pH 4.72) 


* x 2120 A in all cases; 7 20° C. unless otherwise stated. 


b) Poly-a-t-glutamic acid (M.W., 35,000).—Again at 20° C., polyglutamic acid 
exists as a helix in dimethylformamide. Its optical rotatory dispersion is anomalous 
in this solvent, changing sign near 4900 A to assume the large negative specific 
rotations at shorter wave lengths which characterize its benzyl ester. In 2:1 mix- 
tures of 0.2 M aqueous sodium chloride:1,4 dioxane its behavior is strongly pH- 
dependent: at pH 6.56 the repulsions between the ionized carbonyl groups break 
down the helix and the configuration is that of an extended random coil; the rota- 
tory dispersion is normal and closely follows the Drude equation with A» ~ 2100 A. 
On acidifying to pH 4.72, however, the helical configuration is attained reversibly 
and the optical activity displays anomalous dispersion once more. 

Both in the aqueous solution at pH 4.72 and in dimethylformamide, [m’] is 
found to obey equation (10) with A» = 2120 A. On plotting [m’](A? — Ao”) against 
(A? — Xo?) ~!, straight lines are again obtained for this critical wave length; these 
are shown as the broken lines in Figure 2. The parameters ao, bo are listed in Table 
1. The value of by for the aqueous solvent falls in nicely with the similar values ob- 
tained for polybenzylglutamate, and that for the dimethylformamide solution is 
within 10 per cent of these: the slopes of the lines in Figure 2 are all much the same. 
It therefore appears that the constants bo, Ay do not depend appreciably on the na- 
ture of the side chains attached to the a-helix. More data are, of course, required 
to establish this more firmly, but these must await the preparation of further poly- 
peptides. 

It has tacitly been assumed that the helical sense of both polypeptides is the 
same, Since the acid may be prepared from its benzyl ester in anhydrous benzene 
solution,® it seems unlikely that the a-helix should be destroyed and then rebuilt 
during the process. On forming the sodium salt, in aqueous solution, of the ma- 
terial so obtained, it attains the coiled conformation from which, however, it is 
possible to recover the acid in its helical conformation with no net change in its 
optical rotatory power. 

The experimental details of the studies summarized in these paragraphs will be 
published elsewhere. The measurements ranged from 7500 A at the longer wave 
lengths down to 3340 A in the ultraviolet. It was found that, in each case, the 
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observed rotations over this range could be fitted by the parameters in Table | 
to within 4 per cent or to within the experimental error—whichever is the smaller. 


Our samples were kindly provided by Dr. E. R. Blout. We are grateful to him 
and to Dr. P. Doty for stimulating discussions. The purchase of a Rudolph Model 
200 S-80 spectropolarimeter was assisted by a generous grant from the William F. 
Milton Fund, Harvard University. 

1W. Moffitt, J. Chem. Phys., 24, 1956. 

2 P. Doty and J. T. Yang, J. Am. Chem. Soc., 78, 498, 1956. 

3P. Doty, A. Wada, J. T. Yang, and EF. R. Blout, personal communication, not yet published. 

‘ L. Pauling and R. B. Corey, these PRocEEDINGs, 37, 241, 1951; P. Doty, A. M. Holtzer, J. H. 
Bradbury, and E. R. Blout, J. Am. Chem. Soc., 76, 4493, 1954; P. Doty, J. H. Bradbury, and A. M. 
Holtzer, J. Am. Chem. Soc., 78, 947, 1956; E. R. Blout and A. Asadourian, J. Am. Chem. Soc., 
78, 955, 1956. 

5 E. R. Blout and M. Idelson, J. Am. Chem. Soc., 78, 497, 1956. 

6 W. Moffitt, these PRocEEDINGs, Vol. 42, 1956 (in press) 
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ISOTROPIC MATERIALS WHOSE DEFORMATION AND DISTORTION 
ENERGIES ARE EXPRESSIBLE BY 
SCALAR INVARIANTS 
By T. Y. THomas 
DEPARTMENT OF MATHEMATICS, INDIANA UNIVERSITY 
Communicated July 16, 1956 
1. Introduction.—The deformation energy F per unit mass of an isotropic 
material with symmetric stress and rate-of-strain tensors given by o and e, respec- 
tively, is determined by the equation 
dE 
= if de, 1) 
Pa e9 
where p is the density and ¢ denotes the time. If we define the deviation tensors 
o* and e* by writing 
l 


o*ag Cag oS Abas; €* ag = o- - 3 tre bag, 
” . 


with 6 = tr 6, then 


I 
tr de = tr 6*e* + | Otre; 
” 


where we have put £ = cagoag and &* = o*ago*ag. Constructing the relations 


de* = de 1 
—_ , . = 6 tr re 1) 
pa eee: Pit q ire ( 
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it follows from the first equation (3) that F can be represented as the sum of the 
functions e* and e. In accordance with the usual interpretation, we consider e* 
to be the energy per unit mass due to distortion, i.e., change of shape as distinct from 
change of volume, while the function e is the energy per unit mass associated 
solely with change of volume. We shall call e* the distortion energy. Since e = 


E — e*, the simultaneous existence of the energies EF and e* implies the existence 
of the energy e. 

In a recent paper entitled “Deformation Energy and the Stress-Strain Relations 
for Isotropic Materials’! we investigated the conditions which must hold in order 
that the deformation energy FE be expressible as a scalar invariant of the tensors 
o and e when we assume a system of stress-strain relations of the form 


Doag 


a Sex ( ’ ; 1 5) 
Dt pe es" - 


where the f’s are the components of a tensor invariant under proper orthogonal 
transformations relating moving rectangular co-ordinate systems, and the quan- 
tities Doag/Dt are the components of the absolute time derivative? of the stress 
tensor. In particular, we considered a special class of relations (5) characterized 
by the requirement that the quantities fag were linear in the components ¢ag and 
at most quadratic in the cag with coefficients depending on the density p. Such 
systems were called preferred for convenience of reference. It was shown that 
there is a single class of preferred systems with energy FE of the form y¥(é + k6?)/p 
and at most two classes of physically significant preferred systems for which FE has 
the form $(& + /6*), where k is a constant. Denoting these classes by ‘Case A,”’ 
“Case B,” and “Case C,” respectively, the associated energies F are given by 


NE 
_ ) (for Case A), (6) 
3A+ 2yu 


re? ‘ 
— (for Case B), (7) 
SA+ 2b 


¥ [ : (: “8 )] for Case), (8 
og ao _ (ior Case ©), ) 
Pit K 3\+ 2p een each 


where A, u, and K are constants. Expressions (6) and (7) have the formal char- 
acter of the expression for the deformation energy in the ordinary elasticity theory. 
Expansion of the right-hand member of expression (8) shows that the same is also 
true to a first approximation for Case C systems. 

It is the purpose of this communication’ to deduce the further restriction which 
must be imposed on the preferred systems admitting the energies (6), (7), and (8) in 
order that these systems will also have distortion energies e* which are scalar in- 
variants of the stress tensor ¢. The precise results are given in the following sec- 
tions and will not be stated here. However, it will be emphasized that the condi- 
tions on the Case C system reduce this system to a simple generalization of the 
Prandtl-Reuss equations of the theory of plasticity. As a consequence, the Prandtl- 
Reuss equations appear essentially as a condition for the energies E and e* to be ex- 
pressible by scalar invariants (see last equation in Sec. 4). 
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Case A Systems.—For this case the matrix f of the components fag is given by 


} AB + AB102 + (BA + 2 
+ NazE — (BA + Zy)Bé] fre — [(BA + Zw)ay + (BA + 2p) ad + Aa;8] lr ée | T/2Qu 


I , 
+ [Bo + BO - ( — 86) tre — az tr belé + (2 u + a6 + af + azt)e, (9) 
where J is the unit matrix and 6 and e are the matrices of the components gag and 


€ag, respectively. The quantities \ and uw in expression (9) are constants, while 
the a’s and #’s either may be constants or may depend on the density p. Now 


from equations (2), (8), (5), and (9) we have 


Do*a l l 
x i. Bo + B10 — tre + pe — = as ) 6tre — agtr é6*e* | o*a3 
Dt 2 3 


: | a 
+)2u+a0+ ( a +- 3 az} 6? + azt* €*ag. (10) 


Also, using the second relation (3), we observe that equation (6) can be written 
| : 2 pb? 
t 4.428 (11) 
t up 3(3 A + 2 p) 
The separation of the quantities * and 6? on which the energy E depends, accord- 
ing to equation (11), suggests that we should have 
g2 
> é : = (12) 
1 up 6(3 A + 2 w)p 
Substituting this expression for e* into the first condition (4), making use of the 
equation of continuity 
dp 
dt 


+ ptre , (13) 
and equating to zero the coefficients of the different terms in the resulting equation, 
we find that 


Bo = Bi : 2 = ——* =: 3 = (14) 


where we have introduced the quantity A, which, in general, may be a function of 


p. Hence equation (10) becomes 


D o*ag E* 2 M 
Dt =2u{1+ = t*ag — 5 ret K tr d*e* | o*ag. 


From equation (5) and the trace of the matrix f, we now find 
dé = l ia 
=(d3rA+ 2y) tre — = Otre. (16) 
dt 2 


Relations (15) and (16) are equivalent to the stress-strain relations (5) with the 
f’s given by equation (9) under the conditions (14). Moreover, since the second 
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relation (12) is an automatic consequence of relation (11) and the first relation (12), 
we have now proved the following result. The stress-strain relations (15) and (16) 
admit the invariant energies FE, e*, and e given by equations (6) and (12). From the 
derivation of this result it is clear that equations (15) and (16) constitute the most 
general preferred system for which equations (6) and (12) give the energies FE, e*, 
and e. 

3. Case B Systems.—The matrix f of the preferred system (5) admitting the 
deformation energy FE given by equation (7) has the form 


{n8,02 — (3A + 2u)BE + [2 uA + Aa + A(ae + Bo)O? + Aast 


— (3X + 2yu)Pt] tre — [BX + 2u) (a + a) + Aas] tr de} = 
Zu p 
6 e £ ms 
+ (8,0 + B00 tre — aztr ée) + (2 + a0 + af? + ast) —, (17) 
p p 


where, as before, \ and uw are constants and the a’s and §’s are functions of the 
density p. Corresponding to the italicized result in Section 2, the following result 
can now be shown. The system of stress-strain relations 


Do*ag 2b ( 


t* » 
, - Hb , 4 
+ * ) e*ag — — tr 6*e* o*ag, (18) 
Dt p 


K pk 


16 + 2 
; Zs “) tr e, (19) 


dt p 


where K may depend on the density p, admits the deformation energy EF. given by equa- 
tion (7) and also the energies 


6? 


: e=- i (20) 
m 6(3A + 2p) 


The system (18) and (19) is, of course, a special case of the system defined by equa- 
tions (5) and (17) and is, moreover, the most general preferred system admitting 
the above energy invariants. 

4. Case C Systems.—The most general preferred system admitting expression 
(8) for the deformation energy F is given by relation (5) and the matrix 


f = }r\Be? — (BA + 2 yw)HE + [2 wrA + AeA + A(az + B2)O? + Nase 


— (3A + 2 1) Bok | tre — [(3 A + 2 ML) (ay + a) + a;6 | tr ée} 
2 up 


6 
+ (6,0 + 6.0 tre + B; tr be) - + (26+ a6 + af? + azé) 
) 


/ p 


in which 


2u 


a; + Bs = — K 


Here K is a constant, i.e., it is independent of p. The quantities \ and y are again 
constants, while the a’s and §’s may be taken to be functions of p subject to the 
above relation in a3 and 8; by which the constant K is introduced. From equation 
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21) we now have 


Da* 2 l e 
= | BO + | Bo+ = Bs) Otre + Bs tr d*e* |a*ag 
Dt 3 : 


l 
+ 2 w+ ow + (« +- = as) e 


—K l 
log} 1 —- — 
bu 


of relation (8) suggests that 


The form 


f(Z), Z= =, (24) 


Making this substitution in the first equation (4), we can deduce the following rela- 


tion: 


f’ l 
~— 2| a0 + (0, oa : a) O6tre + B; tr d*e* ke 
K | 3 


| Sk 
+ a2 wh + af + («: +3 as) 6? + ct | tr 6*e* t = tr 6*e*, 
» 


Since this relation must be satisfied identically in the variables 6, &*, tr e, and 


tr 6*e*, it follows that 


a = p= 0; 2 - 35 ti 5 93 (25) 


tuZ d 
ae (26) 
K dz 


Integration of this latter relation (26) now gives 


K Yy pan 
—— log{1 ——}], (27) 
bu \ 


where the constant of integration is chosen so that e* = O for &* = 0, Le., for 


vanishing stress. 
Using equations (25), the stress-strain relations can now be expressed in the 


form 


* ‘ . ‘ 
Do ap 4 S ‘ =~ -b 


= 4 - (1 + a) tr 6*e*o* 2, (28) 
Dt ok s 


dé oA + : 2 
) tre — 2 0 tr ée, (29) 
dt p I pk 


where a@ can be taken to be a function of the density.‘ Hence we have arrived 
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at the following result. The stress-strain relations (28) and (29), in which X, pw, K 
are constants and a is a function of the density p, constitute the most general preferred 
system admitting the energies EF and e* given by equations (8) and (27). The energy e 
is given by the difference # — e* of the deformation and distortion energies. 

If K > 0, we see that the quantity in parentheses appearing in the right-hand 
member of equation (23) cannot attain the value A, since this would lead to an 
infinite value of the deformation energy F. In particular, this implies that the 
quantity €* cannot reach the value K, as can, in fact, be seen directly from relation 
(27). To overcome this difficulty, we consider the energy expressions (8) or (23) 
and (27) for K > 0 to be valid in the physical sense only for sufficiently small values 
of the stress, i.e., we regard these formulas as approximate, with range of applica- 
tion limited to displacements which do not deviate too greatly from the natural 
or unstrained state of the material. No such restriction need be imposed for 
negative values of the constant K. 

Taking a = 0 in equation (28), we have 


Do* 26 - =#( - tr 6*e* 2 ) 
€ ap y OT ap |}- 
Dt p K 


These relations are identical with the Prandtl-Reuss equations of the theory of 


plasticity, apart from the occurrence of the quantity pin the right members. More- 


over, equations (28) admit the yield condition &* = K in the sense that this rela- 
tion is left invariant under a motion governed by equation (28). The stress-strain 
relations (28) and (29) can therefore be regarded as applicable both for smail 
stresses and for large stresses capable of producing the plastic state as characterized 
by the usual quadratic yield condition.° 


” J Math. and Phys , to appear. 

2 T. Y. Thomas, “‘Kinematically Preferred Co-ordinate Systems,” these PROCEEDINGS, 41, 762 
770, 1955. 

’ This investigation was conducted at the request of the Naval Research Laboratory and has 
been supported by the Office of Naval Research under Contract Nonr-908(09), NR 041 037, with 
Indiana University. 

* No attention has been paid to the question of the dimensions of the various terms which make 
up the preferred system. In this connection we observe that the @ in eq. (28) should be a quan- 
tity having no physical dimension, a condition which can be satisfied most readily by choosing 
a to be an absolute constant independent of the density. 

5 The statements in this paragraph refer only to certain formal aspects of the system (28) and 
(29) and are not to be construed as meaning that these equations are necessarily suitable for the 
description of plastic behavior. 





CYTOLOGICAL DEMONSTRATION OF THE HELICAL STRUCTURE 
OF GIANT CHROMOSOMES 


By WiLuiAM HovanitTz 


CALIFORNIA INSTITUTE OF TECHNOLOGY, PASADENA, CALIFORNIA 


Communicated by F. W. Went, July 11, 1956 


Alverdes! as long ago as 1912 interpreted the bands of the giant chromosomes 
found by Balbiani? in Chironomus as the result of spiralization. Later, Kaufmann,’ 
applying plant-chromosome techniques to Drosophila giant chromosomes, concluded 
that the discoid or discontinuous appearance of these chromosomes was an ex- 
pression of inadequate technique and that a spiral structure was the more ac- 
curate. Nevertheless, the discontinuous structure appeared to fit genetic data 
more easily. 

Evidence from embryology* ® indicates that the bands or ‘‘disks’’ of the giant 
chromosomes of the Diptera have their origin in the turns of a helical (spiral) 
structure. In view of the difficulty of discerning connections between the bands, 
however, this evidence lacked the necessary proof that the fully formed chromo- 
somes were themselves spiral; in fact, it had been postulated that the coils actually 
broke to form rings. Evidence of a cytological nature has also been given’ that the 
bands of the giant chromosomes are composed of the turns of many small helices 
which together appear to be a band. These data have been marginal in optical 
resolution, however. Other evidence’ of a cytological nature depends largely upon 
a characteristic resolution of the microscope which is of uncertain meaning. Serial 
sections of Drosophila chromosomes* studied with the electron microscope appear 
to indicate that the structure is that of a hierarchy of helices, which appears to be 
similar to that found with isolated chromosomes.® 

In the course of studying Drosophila giant chromosomes for the purpose of at- 
tempting to detect connections between the bands, a portion of one chromosome 
was found which showed a considerable area in which the bands actually were 
connected as in a helix. Further study has revealed that these connections are 
not uncommon, but it is rare to find one as extensive as this. To illustrate the 
helical nature, the strand was photographed from top to bottom at different focal 
levels by changing the fine focus adjustment of the microscope. The series of 
prints obtained does indeed show that the bands of the chromosome are in reality 
turns of helices and that the bands of the diploid or paired chromosomes are in fact 
composed of regularly arranged turns of two helices. 

The series of prints is shown in Figure 1. Starting with one focal level at one 


end and proceeding to the other end, the turns of the helices can be followed all 


around the circumference of the coils between points a and b on the chromosome. 
In prints A—F the strands are slanted in the reverse direction from those in prints 
J-K. The central section of the helix, prints G-/, shows portions in both diree- 
tions, as would be the case were one actually looking at a true helix. 

The groove separating the paired homologues of the diploid chromosome is seen 
slightly below the middle. At H the spiral of one half of the chromosome is in- 
dicated to be independent of the spiral of the other half. In Figure 2 the same 
chromosome is shown flattened further by compression between the slide and 
coverslip. Being in a flatter field, the strands are seen over most of their distance 
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Fig. 1.—Part of a chromosome of Drosophila melanogaster 
(stained and fixed with aceto-orcein and smeared on a 
slide), photographed at successive focal-plane levels from 
top to bottom. Oil immersion obj. N.A. 1.32 
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as they traverse a helical course and thus are seen to be proceeding in both direc- 
tions at the same time. A model showing two helical wires representing the paired 
homologues is shown in Figure 3. Owing to the great depth of focus of the camera 
lens, all portions of the wire model are in focus at the same time, the lower parts 
having been shaded so as to indicate depth. However, in Figure 3, B, the upper 
portions of each coil are darkened; the appearance then is that of the chromosome 
pair in Figure 1, A-F. In Figure 3, C, the lower part of each coil is darkened and 
the appearance is that of the chromosome in Figure 1, J—K. 


Ay 


Fic. 2.—Same chromosome as in Figure 1 after a greater pressure 
on the coverslip. Note the further stretching between the bands. 


From this, the larger bands of the diploid giant chromosomes are shown to be com- 
posed of the combined coils of the synapsed haploid strands. Furthermore, the 
bands are continuous from band to band, being connected together as a double 
helical structure, with the coils symmetrically arranged so that the “‘bands’’ ap- 
pear to be paired. The close compression of the coils in the usual preparation makes 
it impossible to focus clearly on anything but the chromosome surface, which there- 
fore looks like a series of disks or bands (Fig. 4, a). In the model of a single helical 
wire (Fig. 4), the distance between gyres is indicated to be either compressed (as 
at a ore) or in varying degrees of looseness (as at b-d). The loose areas show struc- 
ture comparable with Figure 1 but fail to show the possible breakage of coils (as at 
c, d, or ein Fig. 1). A heavy band such as that at e in Figure 1 might arise as a 
tight concentration of coils as shown at e in Figure 4. However, there is another 
possible explanation to account for the apparent discontinuity between some bands, 
besides breakage, namely, the stretching of the strands so that they are below the 
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F 1G. 3.—Metal models of helices arranged to show 
the appearance of a double helix (A), the same with 
the upper part of each coil blacked out (B), and the 
same with the lower part blacked out (C). 





VoL. 42, 1956 GENETICS: GARNJOBST AND WILSON 


\/\ YN 


a b Cc d & 


Fic. 4.—Model of a single helix with variable distances between the coils to 
illustrate the possible similarity to the giant chromosomes, 


resolving power of the microscope and are thus invisible. This point will be brought 
out in a later paper that will consider the multiple-strand nature of the chromosome 
and of their hierarchy of helices. 


The author wishes to express his gratitude to Dr. Taylor Hinton and Mr. W. 
Goodsmith of the Department of Zoélogy, University of California, Los Angeles, 
for their aid in supplying laboratory space, equipment, and the preparation shown. 
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HETEROCARYOSIS AND PROTOPLASMIC INCOMPATIBILITY 
IN NEUROSPORA CRASSA* 


By Laura GARNJOBST AND JAMES F. WILSON 
DEPARTMENT OF BIOLOGICAL SCIENCES, STANFORD UNIVERSITY, STANFORD, CALIFORNIA 


Communicated by E. L. Tatum, June 28, 1956 


Introduction.—The physiological basis of incompatibility between organisms is a 
subject of considerable interest. Although incompatibility reactions are known 
in less complex forms, they have been studied most intensively in the higher organ- 
isms—for example, the researches concerned with outbreeding in flowering plants 
and blood types in animals. Among the micro-organisms, certain incompatibilities 
described in the past are related to sex, e.g., the mating-type factors of some Pro- 
tozoa and fungi. In contrast, incompatibilities recently described in strains of 
Neurospora crassa appear to be related primarily to the vegetative stages. This 
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type of incompatibility is manifested by the failure of certain strains of like mating 
type to form successful heterocaryons. 

Several biochemical mutant strains of NV. crassa differing in ability to form hetero- 
caryons have been investigated genetically.'' ** Isolates from crosses made in 
this laboratory with strains of the CD and cd heterocaryon genotypes provided 
suitable material for further study of heterocaryosis. Since hyphal fusion ordi- 
narily is a prerequisite for the experimental production of heterocaryons, an answer 
to the following question seemed important. Do the genes which inhibit hetero- 
caryosis in certain unisexual combinations affect some steps in the fusion process, 
or do they affect some subsequent stage? Microscopic observation of growing 
hyphae showed that hyphal fusions do occur between all pairs of isolates that fail to 
form stable heterocaryons, as well as between all pairs that form stable hetero- 
varyons. In the former pairs, however, fusion between hyphae is followed by a 
series of visible changes whereby the cells involved are destroyed. It is the pur- 
pose of the present paper to describe this “incompatibility reaction.” 

Materials and Methods.—Isolates of the snositolless (87401, anos) and riboflavinless 
(Y30539, rib-2) strains obtained in the genetic analysis previously described? * 
were used for most of the microscopic observations. The heterocaryon genotypes 
of the isolates of each biochemical mutant strain are CD, cD, Cd, or ed. Pairs of 
isolates (nos + rib-2) which give wild-type rate of growth on minimal medium 
(het+ response) have the same heterocaryon genotype, i.e., CD + CD, cD + cD, 
Cd + Cd, and cd + ed; those which give no growth or delayed erratic types of 
growth on minimal medium (hef~ response) have unlike heterocaryon genotypes, 
i.e., CD + cD, CD + Cd, CD + ed, cD + Cd, cD + ed, and Cd + cd. Other strains 
and isolates will be mentioned in connection with the particular tests concerned. 

Material for examination was prepared under sterile conditions as follows. The 
cultures to be tested were grown at 30° C. in Petri plates containing a thin layer of 
glycerol complete medium‘ with 2.5 per cent agar. When the mycelium from a 
central inoculation of conidia had reached about halfway toward the periphery of 
the plate, small squares were cut along the advancing hyphal front for transfer to 
slides. Previous to the transfer, a small rectangle of glycerol complete agar (2.5 
per cent) from an uninoculated plate had been put on each slide within a moistened 
Petri plate. The squares of agar with hyphae from each of the two cultures to be 
tested were then placed at opposite ends of the rectangle, so that the hyphal tips 
pointed toward the central ‘“‘bridge.’’ Such slide preparations, six for each pair, 
were incubated at 34° C. until the hyphae growing toward one another on the 
surface of the agar were about 0.3 mm. apart. A drop of minimal medium and a 
cover slip were then added for observation under the microscope at room tem- 
peratures (20°-28° C.). 

The best results were obtained when the agar bridge and squares with hyphae 
were equal in thickness. A narrow bridge facilitated tracing the fused hyphae to 
their origins and allowed diffusion of oxygen. To support the cover slip and reduce 
evaporation, small squares or strips of agar were added to one or both sides of the 
bridge. 

Both phase-contrast and ordinary compound microscopes were used. A magni- 
fication of 645X was found to be adequate for the observations presented. 

Results.—As a basis for this study, the wild type SY7A was examined first. 
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Many types of intrastrain fusions were observed from the time of contact until 
protoplasmic flow took place through the new opening. The types of fusions 
(Figs. 1 and 2) are quite similar to those described in Neurospora and in other in- 
completely septate fungi.® 

The interstrain hyphal fusions of the het*+ pair CD inos and CD rib-2 (in a + a or 
A + A mating types), as well as the intrastrain fusions of each of these isolates, 
were found to be indistinguishable from the intrastrain fusions observed in the wild 
type, as might be expected. The same was true of the intrastrain fusions of each 
member of the het~ pair of isolates, CD rib-2 and ed inos (ina + aor A + A mating 
types, and in the reciprocals, CD inos and ed rib-2), and also of the interstrain fusions 
up to and including formation of an “‘interseptum.” Thereafter, a series of visible, 


irreversible changes was initiated. 

To facilitate a description of these changes, one hypha is termed the “attacking 
hypha”’ and the other the ‘quiescent hypha.” However, in most of the fusions 
observed the quiescent hypha responded by forming a protuberance (Figs. 1 and 


9), and both hyphae often were equally active, i.e., visibly advancing toward one 
another on the agar surface. The subsequent series of changes occurred in the 
following order. 

1. Shortly after contact (from 2 to 30 minutes or more at room temperatures of 
from 25° to 28° C.) protoplasmic movement took place through the new opening. 
The flow in some instances lasted about 45 or 60 seconds, occasionally somewhat 
longer. However, a mere start of movement or no perceptible movement at all 
were also common. 

2. With or without movement, the protoplasm in the vicinity of the fusion, char- 
acteristically in both hyphae, sooner or later became gradually more granular, 
then finely vacuolated (Fig. 6). The vacuoles increased in size, often becoming 
quite large, as shown in Figures 7, 8, and 9. At the same time the intervacuolar 
portion became dense (Fig. 9). The conspicuous, affected region varied in extent 
from one to several cells in each hypha and usually was sharply delimited by septa, 
each containing a distinct opaque disk or plug (Figs. 5, 6, and 10) in place of an 
opening. At other times two or three consecutive septa in the outer cells were 
plugged. When this occurred, the outermost cells frequently showed a delayed 
effect or a graded effect, i.e., less granulation and vacuolation distally. The plugs 
were most prominent at the height of the vacuolation. 

3. Finally, the vacuoles faded from view, leaving the contents of the affected 
cells pale and disorganized. Many hours later shrunken protoplasmic remnants 
could still be seen within the old hyphal walls. 

During the late stage (stage 3) of the incompatibility reaction, the normal cells 
immediately behind the outermost plugged septa often developed new growing 
points (Figs. 10 and 11). These points formed new branches or instead grew within 
the old cell wall, pushing aside the plug, now partly resorbed, as well as the disin- 
tegrating protoplasm. When the latter course was taken, two growing tips within 
the wall of the old quiescent hypha were observed to approach one another and 
fuse, thus reconstituting this hypha (Fig. 12). New hyphae growing within attack- 
ing hyphae discontinued growth or broke out through the wall before reaching the 
point of union. Figures 11 and 12 illustrate the two types of regeneration observed. 

The important result of the fusion and interaction of the two protoplasms appears 
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to be that the affected region in each hypha is effectively destroyed. However, 
the methods employed in this study do not rule out the possibility that some of the 
components of the invading protoplasm may escape into the adjoining unaffected 
region. 

Representatives of all other het*+ and het~ pairs of isolates (nos + rib-2) pre- 
viously described were inspected microscopically. The results obtained with all 
het* pairs were essentially the same; that is, the intra- and interstrain fusions ob- 
served were indistinguishable and like the intrastrain fusions of the wild type 
SY7A. The difference between the intra- and interstrain fusions, as described in 
detail for the het~ pair CD rib-2 and cd inos, also holds for all other het~ pairs 
tested; that is, all interstrain fusions induced the incompatibility reaction. No 
perceptible difference was observed in the reaction between pairs of strains differing 
by one or by two heterocaryon genes. 

In a previous paper*® it was shown that the effect of the genes concerned with 
heterocaryosis is independent of the mutant genes ines and rib-2 and of all other 
biochemical mutant genes examined. Thus it seemed logical to assume that the 
incompatibility reaction would take place following fusion between two isolates of 
the same biochemical mutant strain differing in heterocaryon genotype. This was 
actually found to be the case. Representatives of all types of hef~ pairs of isolates 
previously described were tested. 

The results of heterocaryon tests (unpublished) made in our laboratory with 
certain biochemical mutants suggested that the wild types from which they were 
derived differ in heterocaryon genotype. These wild types are SY7A or SY4a and 
ST A or ST a (St. Lawrence strains).? Since heterocaryon formation of nutri- 
tionally wild-type strains cannot be determined by the usual methods, it was 
thought that a difference might be detected microscopically. To test this possi- 
bility, the following pairs of wild types were observed: (1) SY4a fy + Lindegren 
(L) 25a; (2) ST a + SY4a fy; (3) ST a + L25a; (4) pe” fl a (Y8743-21 (13-7), 
microconidial fluffy) + SY4a fy; (5) pe” fla + ST a; (6) pe” fla + L25a; and (7) 
SY4a fg + ST a. Consistent results were obtained as follows: (a) All strains 
formed functional intrastrain fusions in all preparations. (b) In preparations in- 
cluding a St. Lawrence strain, the interstrain fusions were of two types—completed 
fusions initiating an incompatibility reaction and uncompleted fusions apparently 


resulting from failure to produce an opening at the point of contact. (c) All other 


pairs were found to be compatible according to the new test. 
The mating-type factors A and a in Neurospora have also been called incom- 
patibility factors.’ Since the function of these allelic genes is not well understood, 


Frias. 1-12.—Photomicrographs of living, unstained hyphae of Neurospora crassa growing on 
nutrient agar. 600. Arrows indicate fusions not obvious in photomicrographs. 

Fias. 1 and 2.—Intrastrain fusions. 

Figs. 3 and 4.—Interstrain fusions, het* pairs. 

Figs. 5 and 6.—Early stages of the incompatibility reaction, het~ pairs. Plugs visible in septa 
delimiting affected regions. 

“tas. 7, 8 and 9.—Later stages of reaction showing prominent vacuolation: Fig. 7, in wild- 
type pair differing in mating type; Fig. 8, in pair differing in both mating type and heterocaryon 
genotype; Fig. 9, in pair differing in heterocaryon genotype. 

Fic. 10.—Later stage of reaction in which vacuoles remained small. New growing point at 
lower left. 

Figs. 11 and 12.—Types of regeneration. 
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it seemed of interest to extend the new test to strains differing in mating type. The 
pairs selected for these tests were CD rib-2 a + CD inos A, CD inos a + CD inos A, 
and CD inos A + cd inos a. In these tests interstrain fusions were infrequent in 
some slide preparations but common in other preparations of the same pair. When- 
ever the fusions did occur, an incompatibility reaction was initiated. The series of 
changes in most instances was similar in all respects to that described for het~ pairs 
of the same mating type, but in the first two pairs listed an occasional difference was 
noted. This difference consisted of an unusually long period of protoplasmic flow 
through the interconnection, followed by a delay in the appearance of the reaction. 

This unexpected evidence of incompatibility led to still further tests, this time 
with pairs of wild-type strains differing in mating type, as follows: (1) SY4a fy + 
SY7A; (2) SY7A + L25a; (3) SY7A + ST a; (4) ST A + ST a; and (5) ST A + 
L25a. As in the tests mentioned above, the interstrain fusions observed between 
these A and a strains also initiated an incompatibility reaction (Fig. 7). The vari- 
ability described above was also found in pairs 1, 2, and 4. In addition, in pairs 
3 and 5a few fusions appeared to be uncompleted. 

Although no constant difference in the incompatibility reaction was observed in 
a + A pairs as compared to a + a or A + A pairs, the results of the preliminary 
tests with strains differing in mating type suggest that the incompatibility reaction 
in these combinations is more readily affected by environmental conditions. 

Discussion.—These findings raise certain questions regarding heterocaryosis, mat- 
ing-type factors, and particularly regarding the nature of the incompatibility reac- 
tion. It has been concluded from a variety of experiments that functional anasto- 
moses do not occur between mycelia of different species.® * The Neurospora species, 
crassa, tetrasperma, and sitophila, were among those tested.6 The incompatibility 
reaction described in this paper, then, might be looked upon as similar to one of 
the many possible genetically controlled barriers leading to the evolution of sexual 
differentiation or speciation. This irreversible reaction appears to be an effective 
mechanism to prevent heterocaryosis in either unisexual (hef~) or bisexual'® !' com- 
binations, but in the A + a@ combinations the stimulus inducing the reaction might 
be present only during the vegetative stages. 

Numerous observations concerning hyphal fusion in fungi during vegetative 
growth and reproduction have been reported in the literature. Some of the findings 
described in this report are not new—for example, the open “‘pore’’!? in the center 


“é 


of each hyphal septum, the “ pore-plug’’!? which appears as a result of injury to one 
or more cells in a hypha, and the growth of new hyphae within the old hyphal 
walls.'* Most recently Gerdemann'* described, in a phycomycetous fungus, a 
method of wound-healing which differs slightly from that observed in a number of 
species by Buller’? and in Neurospora during the present study. These injury 
reactions resemble the incompatibility reaction but differ from it in one important 
respect—the absence of interhyphal fusion and subsequent intermingling of in- 
compatible protoplasm. 

In Protozoa there is at least one recorded observation of protoplasmic incom- 


patibility not related to sex. Reynolds’ ‘shattering reaction’ in 


investigated a 
Arcella polypora and concluded that this reaction was a transitory change in pheno- 


type induced by environmental differences. In contrast, the heterocaryon pheno- 


types of the Neurospora strains used in this study have remained consistent with 


their genotypes. 
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In comparing the C and D (or ¢c and d) genes to the incompatibility factors of some 
higher fungi, as seems inevitable, it should be borne in mind that the strains of 
Neurospora obtained in the genetic analysis” * are either of the a or A mating type 
as tested with SY7A and SY4a (or ST A and ST a) and form perithecia when and 
only when the a and A strains are crossed, regardless of the heterocaryon genotype. 

At this stage of the investigation in Neurospora, little is known of the relationship 
between the incompatibility reactions occurring (1) in pairs composed of strains 
of the same mating type and (2) in pairs composed of strains of different mating 
types. However, the results obtained with the various het+ and het~ pairs appear 
clear cut. The agreement between the heterocaryon tests previously made and 
the occurrence of the incompatibility reaction in all hef~ pairs is complete. Fur- 
thermore, the observations indicate that the cause of the incompatibility reaction 
lies in the protoplasmic portion of the mycelium. 

Summary.—The genes C and D and the alternate alleles ¢ and d in N. crassa are 
concerned with heterocaryon formation, as previously reported. Heterocaryon 
tests with pairs of isolates of the same mating type proved that all pairs of the same 


heterocaryon genotype form compatible heterocaryons, whereas all other pairs are 
incompatible, i.e., fail to grow or grow erratically on minimal medium. These two 


classes of pairs were termed het* and het~, respectively. 

The present experiments indicate that the reaction between members of all 
het~ pairs involves protoplasmic incompatibility. Representative isolates of both 
het+ and het~ pairs of the riboflavinless (Y30539, rib-2) and the inositolless (37401, 
inos) strains were observed microscopically during vegetative growth, with results 
as follows: In the het~ pairs all interstrain hyphal fusions initiated an irreversible 
incompatibility reaction. Upon exchange of protoplasm through the new opening 
between the hyphae, the cells in the vicinity of the fusion gradually disintegrated. 
Since no exception was found in any het~ pair and all interstrain hyphal fusions 
in all het*+ pairs were normal, it is concluded that the incompatibility reaction is 
the mechanism whereby heterocaryotic growth in the het~ pairs is inhibited or 
prevented. 
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Introduction.—Genetic recombination of the type first described by Lederberg 
and Tatum! in Escherichia coli, strain K12, has been shown to involve conjugation? 
between sexually differentiated cells (F+ or Hfr with F—).* 4 Unilateral gene 
transfer from Hfr (high frequency of recombination) to F— cells is suggested by 
the observation that recombinants occur only among the progeny of the F— ex- 
conjugants.2»4 In recent experiments by Wollman and Jacob,’ Hfr:F— conju- 
gants were separated by rapid stirring in a Waring Blendor, and then recombinants 
arising from the transfer and incorporation of two linked Hfr genes (7'+ and L+) 
were selected by plating on appropriate minimal medium. It was found that, 
among these selected recombinants, the frequencies of several other, unselected 
Hfr genes increased with the duration of conjugation before the parental pairs were 
separated. These results indicate that all genes are not transferred simultaneously 
and suggest that, under these conditions, partial genomes may be transferred from 


Hfr to F— cells. Further interpretation, however, is limited by the techniques 


employed. First, owing to the exclusive use of selected recombinants, the existence 
of a single, or preferred, sequence of gene transfer is not clear. It could be argued, 
for example, that during conjugation the probability of multiple gene transfer in- 
creases, but that no specific gene is regularly transferred earlier than another. In 
the present investigation the analysis of unselected recombinants has revealed that 
gene transfer is indeed oriented. Second, the method of disrupting pairs is suffi- 
ciently drastic to raise the possibility that the inferred transfer of partial genomes 
may not characterize “‘normal’’ mating.® In the present study this possibility has 
been minimized by the observation of exconjugants which originate in the absence 
of artificial disruption. 

Materials —Two Hfr strains are employed, both ultimately derived from the 
F+ strain, 58-161. W2323 is a subculture of the Hfr strain isolated by Hayes;? 
CS101 isa V,’ V.’ derivative of W1895, the Hfr strain originally isolated by Cavalli.’ 
CS2 is an F— derivative of 58-161. .W2323 and CS2 share the following genotype: 
M— T+ L+ Th+ Lac, + Mal, + Vi' V2' Ve'; CS101 differs only in being V{’ V2’. 
CS100 is an F— strain derived from Y53 and is of the following genotype: 1+ 
T— L— Th— Lacy— Mal,— V;° V2 V,’. The origin of 58-161 and Y53 is de- 
scribed elsewhere.’ The designations M—, 7—, L—, and Th— refer to methio- 
nine, threonine, leucine, and thiamine growth requirements, respectively, Lac— 
and Mal— to inabilities to ferment lactose and maltose, and V1’, V2’, and V,’ to re- 
sistance to coliphages T1 (and T5), T2, and T6. Wild-type T1, T2H, and T6 were 
employed; T2 and T6 were inactivated by a dose of ultraviolet light which produced 
about 50 lethal hits per phage. 

Experimental Procedures and Results.—When preincubated mixtures of CS101 
(Hfr Lac+ Mal+) and CS100 (F— Lac— Mal—) are plated on complete media 
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which permit the differentiation of parental fermentative types (EMB-lactose or 
EMB-maltose'®), a high percentage of colonies are characteristically sectored. 
These colonies contain both fermenters and nonfermenters and resemble closely in 
appearance the segregating heterozygotes pictured by Lederberg.'!! Since these 
variegated colonies (Lacv and Malv) do not appear when preincubated mixtures of 
CS2 (F— Lac+ Mal+) and CS100 are similarly plated, they presumably reflect 
the mating process. 

In order to examine the kinetics of formation of those units which give rise to 
variegated colonies on plating, the following standard procedures were adopted, 
which also met the demands of concurrent radiochemical studies.'? Parental 
cultures, grown for 12 hours in Difco Penassay broth at 37° C. without aeration, 
were washed and resuspended in peptone broth (1 per cent Difeo Bacto-Peptone, 
0.3 per cent NaCl, 0.1 per cent glucose, 10~* M MgSO,, 10~-* W CaCl.) at one- 
twentieth of their original concentration. After aeration at 37° C. for 2'/, hours 
(to give about 2 X 108 cells/ml), the cultures were centrifuged, washed, and resus- 
pended at a twofold dilution in Penassay broth. After further aeration at 37° C. for 
30 minutes, the cultures were centrifuged and concentrated fivefold in fresh Penas- 
say broth (to give about 10° cells/ml). At this time, the cultures were mixed, | 
part Hfr to 2 parts F—, and aerated at 37° C. Under these conditions, the total 
cell titer increased about twofold during 80 minutes of mixed growth. After 
varying intervals, samples were removed from the mixture and either were directly 
diluted in saline and plated to give isolated colonies on EMB-lactose or EMB- 
maltose or were exposed to UV-inactivated T2 or T6 prior to dilution and plating. 
Phage exposure consisted of 5 minutes’ adsorption in buffer at a multiplicity of 20 
phage/cell and at a cell concentration of 2 & 108 per milliliter. The numbers of 
“4? (fermenters), ‘‘—’’ (nonfermenters), and variegated colonies were determined 
after 18-24 hours’ incubation at 37° C. Mutant “‘+”’ papillae appeared later on 
“—”’ colonies and were easily distinguished from the more sectorial variegation. 

At no time did variegated colonies (Lacv or Malv) appear following exposure to 
T2 (which attacks the /— parent), as is to be expected if gene transfer is unilateral 
from Hfrto F—. 

Figure 1 shows the results of experiments in which measurements were made 
simultaneously of the frequencies of Lacv and Maly colonies before and after ex- 
posure to T6 (which attacks the Hfr parent) at different times during the mating 
period. Each point is based upon a sample containing 10°-104 Hfr cells as pooled 
from three separate experiments. The total number of Hfr cells at each time 
was taken as the number of Mal+ and Malv colonies which appeared when cells 
were not exposed to T6 before plating. The results are consistent with the follow- 
ing interpretation. 

Conjugating pairs yield variegated colonies (Lacv and Malv) when plated without 
exposure to T6. Exposure to T6 kills the Hfr parent and thus destroys the ca- 
pacity of a pair to produce a variegated colony unless transfer of the pertinent gene 
to the F — cell has already occurred and will be followed by incorporation. In the 
latter event, since the surviving /— cell is multinucleate, it is still capable of pro- 
ducing a variegated colony. Therefore, Lac+ is transferred and incorporated more 
frequently than Mal+, since Lacv colonies suffer much less reduction than Malv 
colonies following exposure to T6. In fact, the number of Malv colonies after T6 
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exposure is so small that the number of Malv colonies which appear when cells are 
not exposed to T6 may be taken as a measure of plated pairs. The excess of Lacv 
over Malv observed when cells are not exposed to T6 is ascribable to fertilized F — 
cells which have separated from their Hfr partners prior to plating. These “‘spon- 
taneous’’ exconjugants appear as early as 10 minutes after mixing the parental 
strains and increase throughout the period studied. 

On the preceding interpretation, exposure to T6 removes Hfr parental cells and 
leaves unfertilized F— cells together with an unselected array of fertilized cells 
The latter are sufficiently numerous that their derivative clones may be analyzed 
by replica-plating.'* Accordingly, mixtures of CS101 and CS100, after exposure 
to T6 at various intervals during the mating period, were plated on EMB-lactose 
(nonselective medium) as before. Then, after scoring for Lacy, all colonies (Lac— 
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MINUTES AFTER MIXING PARENTAL CULTURES 
Fic. 1—CS101 (Hfr Lac+ Mal+ V.*) XK CS100 (F— Lac— 
Mal— Vet). The frequencies of variegated colonies (Lacv and 
Malv) appearing on EMB-lactose or EMB-maltose when samples of 
the parental mixture were plated at different times. The curves 
designated as Lacv (+T6) and Malv (+T6) were obtained when 
mixtures were exposed to T6 before plating. 
as well as Lacv) were examined for the presence of the Hfr genes, V;, and L+. Vj,’ 
was detected on EMB-lactose spread with about 10° T1; Z+ was detected on 
minimal medium (EMS-lactose") supplemented with methionine, threonine, and 
thiamine (at concentrations of 10, 20, and 0.01 mg/I, respectively). The replica 
plates were scored after 18-24 hours’ incubation at 37° C. 

Colonies containing all seven possible combinations of the three Hfr genes scored 
(Lac+, V1’, and L+) were observed. Where two or three were present in a single 
clone, they were not invariably carried by the same subclone. Among those re- 
combinant clones containing at least one of the three Hfr genes, three types pre- 
dominated (94-99 per cent): Those which contained Lac+ but not V;’ or L+, 
those which contained Lac+ and V,’ but not 1+, and those containing all three. 
As indicated in Figure 2 (solid symbols), the relative frequencies of these three 
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main types vary with the duration of mating prior to exposure to T6 and plating. 
This confirms the general conclusion of Wollman and Jacob that not all Hfr genes 


are transferred simultaneously. 
proceeds in a preferred sequence: 


In addition, it provides evidence that transfer 
Lac first, then V,, and finally L. 


The fact that this sequence of transfer is exactly opposite to the sequence sug- 
gested by the selected recombinants of Wollman and Jacob® requires explanation. 


The two sets of experiments differ in four important respects: 
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20 
MINUTES AFTER MIXING PARENTAL CULTURES 


Fic. 2.—CS101 X CS100. The frequencies of different 
types of F — exconjugant clones obtained when samples 
of the parental mixture were plated at different times on 
nonselective medium. Each clone was scored for the 
presence of three Hfr genes: Lac+, V,', and L+. 
“Total recombinant clones’? is taken as the number 
which contained at least one of these genes. Solid sym- 
bols refer to clones obtained after T6 screening; open 
symbols refer to spontaneous exconjugant clones. At 
10, 15, and 50 minutes, 50-100 clones were examined, 
at other times 100-300. 


of the Lac, 
of transfer. 


the method of 
interrupting mating, the 
method of selecting recombi- 
nants, the Hfr strains used, and 
the lysogenicity of the strains 
used (Wollman and Jacob used 
nonlysogenic strains; all strains 
employed here are lysogenic 
for the temperate phage lamb- 
da). ‘To determine the source 
of discrepancy in results, ex- 
periments were performed 
which used the T6-screening 
method described here and the 
lysogenic progenitor (W2323) 
of the Hfr strain employed by 
Wollman and Jacob. The re- 
sults summarized in Table 1, 
although not extensive, appear 
to verify the temporal trend in 
gene frequencies reported by 
Wollman and Jacob. In partic- 
ular, the fact that 
containing Lac+ alone are al- 
Ways in a small minority, in 


colonies 


contrast to the results obtained 
with CS101 (Fig. 2), indicates 
that differences between the 
Hfr strains (other than lyso- 
genicity) are sufficient to ex- 
plain the different sequences of 
transfer which have been ob- 
served. These differences may 
consist of structural inversions 


L region with respect to some point which is involved in the initiation 
Jacob and Wollman' have discussed the possible significance of such 


structural differences in determining frequency of recombination. 


A reasonable model for the mating process has been afforded by observations on 


Het strain crosses (which yield persistent aneuploids).'! & 
plete Hfr (or F+) chromosome is transferred to a fertilized F— cell. 
then occurs at one or more specific points on this transferred chromosome. 


On this model, a com- 
Breakage 
Reduc- 
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tion division follows rapidly, so that Hfr genes are lost unless crossovers have 
incorporated them into the complete /— chromosome. This explains the general 
predominance of /— markers in K12 crosses. The generality of this model has 
been questioned for some time,'® but the experiments of Wollman and Jacob and 
ourselves provide the first strongly contrary evidence. This evidence consists of 
the fact that, to explain the temporal changes in gene frequencies, varying points of 
breakage of the Hfr chromosome must be assumed. If that part of the model 
based on Het crosses which calls for specific breakage points is not general, the 
generality of the part which calls for complete transfer of the Hfr genome is also 
open to question. The simpler alternative view that gene transfer may be nor- 
mally partial becomes an economical working hypothesis. 


TABLE 1 


W2323 (Hfr Lac + L + Ve") X CS100 (F — Lae— L— V4‘). Analysis of F-exconjugant clones 
containing either Lac+ or L+, obtained by exposure of the parental mixture to T6 after vary- 
ing intervals of growth. 


Duration oF Mrtxep GrowtH (MINUTES) 
CLoNnEs WHICH CONTAINED 15 30 45 60 


Lac+ only 
Lac+ and L4 
L+ only 
Total clones examined 


However, in both the present experiments and those of Wollman and Jacob, 
exconjugants were obtained by artificial methods (exposure to phage or shearing 
in a blendor) which might induce abnormally partial transfer.6. Therefore, an 
analysis was made of those exconjugants which arise spontaneously in a mating 
population of CS101 and CS100. This was done by plating samples at intervals 
directly onto EMB-maltose, where / — exconjugants and unmated F— cells form 
Mal— colonies. All colonies were then replica-plated to EMB-lactose, EMB- 
lactose plus Tl, and EMS-lactose supplemented with methionine, threonine, and 
thiamine. Those Mal— colonies were then examined which contained one or more 
of the three Hfr genes, Lac+, V1’, and L+ (i.e., demonstrable exconjugants). The 
relative frequencies of combinations of Hfr genes among these exconjugant clones, 
plotted as open symbols in Figure 2, show that the same trend occurs whether or 
not mating is deliberately interrupted with phage. Therefore, discontinuities 
arise at varying sites in the fr genome under conditions corresponding closely to 
those prevailing in any bacterial cross, suggesting the normal transfer of partial 


genomes. In fact, the close agreement of trends among spontaneous and T6- 
induced exconjugants (Fig. 2) suggests that phage (and perhaps blending as well) 
may play no role in the induction of partial transfer but instead may screen for 


spontaneous exconjugants by rendering conjugating pairs incapable of producing 
recombinants. 

Summary.—N onsimultaneous gene transfer from Hfr to F— conjugating cells in 
FE. coli K12 is confirmed. This transfer is oriented; certain genes regularly are 
transferred earlier than others. The sequence of transfer depends upon the Hfr 
strain employed, possibly due to structural differences. Sequential transfer is 
demonstrable among spontaneous exconjugants, strengthening the view that, nor- 
mally, incomplete genomes are transferred. 
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A PROPERTY OF SPECIAL JORDAN ALGEBRAS 
By A. A. ALBERT 
oineniiemcaies tial eiieneeig 
Communicated June 29, 1956 


Let Mt be the set of all linear transformations on a finite-dimensional vector 
space over a field § of characteristic not two. Then 9 is an associative algebra 
with respect to the operation ab of the ordinary product of linear transformations 
aand b. But 9 also forms a Jordan algebra Yt with respect to the operation 
a:b = '/,(ab + ba). 

At a recent Conference on Linear Algebras sponsored by the National Academy 
of Sciences—National Research Council, Nathan Jacobson conjectured that every 
Jordan algebra 8 is the homomorphic image of a special Jordan algebra %X, that is, 
the image of a subalgebra of a Jordan algebra 9'*’. In the present note we shall 
disprove the conjecture by showing that the known simple exceptional Jordan 
algebra is not such an image. Indeed, we shall derive the following more general 
result. 

THEOREM. Let 8 be a Jordan algebra with radical such that B is the homo- 
morphic image of a special Jordan algebra A. Then B — isa special Jordan algebra. 

For % is a Jordan subalgebra of an Nt'*’ and has an enveloping associative sub- 
algebra A* of M. Let N be the radical of Y%*, and form the semisimple associative 
image algebra (*)’ = %* — XM. The elements of (2*)’ are the cosets x’ = x + N 
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defined for all elements x in %*, and the mapping S, defined by 

rect, 
is the natural homomorphism of %* onto A* — N. Then S maps the special Jordan 
algebra % onto a special Jordan subalgebra Y’ of [(2(*)’]'* consisting of the cosets 
a+ Nforain %. It should be clear that 


(2’)* (*)’ 


is the enveloping associative algebra of %’. It is known! that the radical of 4 
is contained in the radical of %*. Similarly, the radical of YA’ is contained in the 
radical of (%*)’, that is, Y’ is a semisimple special Jordan algebra. 

We are assuming that 8 is the homomorphic image of Y{. Thus we may assume 
that 8 = A — GC, where C€ is an ideal of Y%. Let C’ be the subalgebra of Yt’ consist- 
ing of the cosets c + Nforcin €. Then @’ is an ideal of YX’. Define 


BY’ = A’ — C’. 


The elements of B’ are cosets a’ + @’ for a’ in A’. It is easy to verify that the 
mapping 


a+ Gra + 


is a homomorphism 7’ of B onto B’. 

The kernel & of T is the set of all elements k + € of 8 such that k’ + €’ = @’, 
that is, such that k’ is in ©’. The elements of ©’ are the cosets c + Jt, where c 
isin & Now k’ = k + MN with k in A, and we have k + € = h + GC, where h 
isin 3. Thus k’ is nilpotent, and & is a nilideal. It follows that 8’ is isomorphic 
to B — R, where R is contained in the radical of 8B. However, 8’ is the homo- 
morphic image of a semisimple Jordan algebra YX’ and so is semisimple. Hence 
R must be the radical H of B, and B — His isomorphic to B’. But Y’ is semisimple 


‘7e4 


and @’ is an ideal of %’, so that 9%’ = ©’ @® D’, where, then, D’ and B’ are iso- 
~ ’ , 


morphic. Hence %’ is isomorphic to the subalgebra D’ of a special Jordan algebra 


1’ and is special. 


* This paper was sponsored in part by the Office of Ordnance Research, United States Army, 
under Contract No. DA-11-022-ORD-1571. 

‘Cf. Theorem 10 on p. 534 of the author’s paper “On Jordan Algebras of Linear Transfor- 
mations,’ 7'’rans. Am. Math. Soc., 59, 524-555, 1946. 





ON A CLASS OF FUNCTIONAL EQUATIONS 
MODULAR TYPE 


By RicHARD BELLMAN 
RAND CORPORATION, SANTA MONICA, CALIFORNIA 


Communicated by H. S. Vandiver, July 3, 1956 


Introduction.—The Voronoi functions, or generalized Bessel functions, 


V.(2, y) = i eT nee tl ds/8°, (1.1) 
70 

play an important role in analytic number theory in connection with lattice-point 

problems and other investigations involving zeta functions. First introduced by 

Voronoi in his classic paper on the Dirichlet divisor problem, they have been treated 

by Hardy' and also by Steen,? who considered the more general functions defined 

by the relation 


f V(a; ai, @2,..., a,)x*~! dx = T'(s + a)P(s + a)... 0(s + ay), (1.2) 
0 


a class of functions whose importance was recognized by Voronoi. Extensions of 
these functions in the directions of algebraic number fields and matric fields have 
been sketched by Bellman,* and the matrix functions have been discussed in some 
detail by Bochner.‘ The function arises in the research of Hecke and in the con- 
tinuation by Maass.° 

In a previous paper*® we sketched some extensions of the method used by Hardy 
to obtain a number of striking identities satisfied by series formed with these func- 
tions. In this paper our aim is to indicate how to form extensive classes of func- 
tions satisfying functional equations of modular type, using the function above 
and various extensions and generalizations within the rational field, algebraic 
number fields, matric fields, hypercomplex fields, and finite fields. 

A more detailed account will be presented subsequently. 

2. The Rational Field.—Consider the function defined by the series 


f(z, y;u,vjht)= dS V((e+ m)V t,, (y + n) V to)e2*imu time) (2.1) 
where 0 < x, y, u,v < land th, tg > 0. Using the representation in equation (1.1), 
this may be written as 


} (4+ m)?*sti+2ximu 
te, ys wot) = fo | Senet | 
0 m= —« 


e 
a 


8 


> —x(y+n)%te/s + 7 ds 


Applying the functional equation for the theta function in each of the brackets (in 
the previous paper*® this was applied to only one of the brackets), we obtain the 
functional relation 


626 
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¢ 2ni(ru +yv) 
f(x, y; 
V tite 
3. An Observation,—Let us note that no properties of the weight ds/s* have 
been used. Consequently, the same functional equation holds for any weight fune- 
tion dG(s) permitting the interchanges of summation and integration. 

In addition, contour integrals of the form f, e~**?~*’? dz/z* may be used, 
with different contours yielding different functions, and slightly different types of 
identities. 

4. Specialization.—If we allow z, y, u, and v to assume rational values, a num- 
ber of interesting identities involving sums over divisors in specified residue classes 
are obtained. 

This is a consequence of the relation 


V.(z, y) ati Tis pin z*~* V.(\ zy), (x ¥0). (4.1) 


Consider, in particular, the case where a = 1, y¥=2u= eo = 1/2.. Theo 
equation (2.2) yields, with 4t = (, 


= _ ft} + 2m) (1 + 2n) 


m, 7 


l : : (1 + 2m) (1 + 2n) , 
> v,( )i-" =» (42) 


t m,n df 


~ Rt | ee R 
» v.( 5 ) ace) = — = v,( " a(R), 
R=1 4 t R=1 tf 


a(R) = : (—j)*t* 
(1+2m)(1+2n) R 


where 


Similar identities arise from other sets of rational values for x, y, u, and ». 
5. Generalizations in the Rational Field. I.—Using the generalized Voronoi 
function in the form 


V (x1, Ti 4 eres » Ga, ..- » Ay) = 


ex ex k Is 
J opie | e7 rent... —raekt— rake it/am... ee py OSE (5.1) 
0 0 ‘ 


a 
#=18; 


cf. Steen? and Bellman,’ a relation analogous to that of equation (2.2) can be ob- 
tained for the function defined by the series 


m 


+2 V ((e oo m)V ty, «0 vp (Sete + mye+1) V tet oS a Cee os a.) e 
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where 0 < x;, u; < 1, ¢; > 0. These identities yield analogues of equation (4.3) 
for higher-order divisor functions. 

6. Generalizations in the Rational Field. II.—TYwo further lines of generaliza- 
tion are immediate. In place of the two-dimensional version of equation (5.1), 
we may employ integrals of the type 


e w(sixi?+ sore? + sisers? — r42/s182] dsyds» ( | ) 
ai, a2? ). 
0 0 8, Se 


utilizing all combinations of products, one at a time, two at a time, and so on. In 
place of forming sums such as that of expression (5.2), we can form sums of the type 


> Ve((Q((ai + m:), (te + me))]”, (Cy +n) Vier tmtee) §=—(6 


mi 


where Q(u, v) = tu? + 2tuv + tv? is a positive-definite quadratic form. Use of the 
multidimensional theta-function transformation will yield the desired functional 
equation. 

7. Algebraic Number Fields.—Proceeding in the manner made classical by the 
work of Hecke, it is easy to form the corresponding Voronoi functions in totally 
real fields. Thus, for example, in R(/2), we have, as the analogue of the function 
in equation (1.1), 


, tr(wx2s + ry?/s) ds\ds» 
V(x, y) ( - 
J 8 0 (s Ro" 


9 
, elioad 


where the integration is over the region defined by s; > + s2+/2, and x 
aV2,y = nt wvV2. 


8. Matric Fields.—Similarly, over matric fields, following the work of Siegel, 
we may define the function of two symmetric matrices X and Y, 


> - > —tr(erX2S rY¥2871 dS 
V(X, Y) = oracle (8.1) 
S>0 s\° 


where the integration is over the region where the symmetric matrix S is positive 
definite, dS = II ds 


ijy 
as 


and | S| is the determinant of S (cf. Bellman*® and 


cj Sn 
Bochner‘). 

9. Discussion.—In a like fashion, generalized Bessel functions can be defined 
over a number of hypercomplex fields for which theta functions exist. This is an 
interesting direction of research which we shall develop in future papers. 

10. Finite Fields—Kloostermann Sums.—An analogue cf the Voronoi function 
in finite fields is the Kloostermann sum 

p—l 
Kis,¢,.9) = > Ore (10.1) 
n=1 
with a variety of other analogues obtainable from the continuous forms mentioned 
above. Results corresponding to those sketched earlier* and to those above can 
be obtained for these functions. 


1G. H. Hardy, “Some Multiple Integrals,’’ Quart. J. Math., 39, 357-375, 1908 
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ON THE PRINCIPLE OF INVARIANT IMBEDDING AND 
PROPAGATION THROUGH INHOMOGENEOUS MEDIA 


By RicHARD BELLMAN AND RoBERT KALABA 


RAND CORPORATION, SANTA MONICA, CALIFORNIA 
Communicated by Lyman Spilzer, Jr., June 15, 1956 


1. Introduction.—In 1943, Ambarzumian' introduced a new approach to the 
study of atmospheric scattering problems. This technique was considerably 
extended by Chandrasekhar,? who utilized it to resolve a number of long-standing 
problems and gave it the name “principle of invariance.”’ 

In a series of papers, of which this is the first, we shall show that the full power 
of the method has not yet been realized. A number of problems of current interest, 
in scattering theory, in neutron diffusion, in the propagation of radio waves through 
stochastic media, as well as a variety of problems in other fields, may be treated 
by its use. 

In addition to the varied physical applications, quite a number of mathematical 
questions arise concerning the relation between this approach and the classical 
techniques based upon partial differential and integral equations. These questions 
will be discussed subsequently. 

In this note we are primarily interested in methodology. As an illustration of 
the general method, we shall discuss the scattering of light by an inhomogeneous 
plane medium of finite thickness. The corresponding problem for homogeneous 
media was treated by Ambarzumian and Chandrasekhar. 

2. Principle of Invariant Imbedding.—Although space limitations do not permit 
us to diseuss in any detail the close connections existing between the approach 


presented below and a number of fundamental techniques of classical analysis, 


we would like to mention some of these relations in a very brief fashion. 

To begin with, the method is closely related to the “point of regeneration”’ 
method used in the study of stochastic processes, particularly in the study of 
“branching processes”’ arising in the study of bacterial growth, electron cascades, 
and cosmic-ray cascades (cf. Bellman and Harris;* 8. Janossy*). Second, the 
invariance principle is intimately related to the principle of causality and hence to 
Huygens’ principle (cf. Hadamard®). Consequently, there is a close connection 
with the theory of semigroups (cf. Hille®). 
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Finally, we should like to point out that the method is a decision-free version of 
the functional-equation technique of dynamic programming (cf. the “principle of 
optimality” [Bellman’}). 

In his book? Chandrasekhar refers to an early use of the basic principle by 
Stokes.* 

Let us now state the 

PRINCIPLE OF INVARIANT IMBEDDING. Given a physical system, S, whose state 
at any time t is specified by a state vector, x, we consider a process which consists of a 
family of transformations applied to this state vector. 

Suitably enlarging the dimension of the original vector by means of additional com- 


ponents, the state vectors are made elements of a space which is mapped into itself by 


the family of transformations. In this way we obtain an invariant process, by imbedding 
the original process within the new family of processes. 

The functional equations governing the new process are the analytic expression of 
this invariance. 

3. Inhomogeneous Plane Media of Finite Thickness.—To illustrate the method, 
let us consider one of the most immediate applications, the multiple scattering of 
light by an inhomogeneous plane medium of finite thickness. The homogeneous 
version of this problem was treated by Ambarzumian and Chandrasekhar. 

In this case, the thickness of the slab will be the parameter describing the state 
of the process. It will be clear from what follows how the procedure may be 
modified to apply to the corresponding problems for spherical and cylindrical 
regions. 

Consider the region bounded by two infinite parallel planes, possessing for the 
sake of initial exposition the requisite homogeneity properties as far as two space 
dimensions are concerned, so that it suffices to consider the plane region bounded 
by two parallel lines at a distance x apart. 

In order to derive the equations governing the process, we shall, as usual, con- 
sider the area as consisting of a sequence of infinitesimally thin parallel zones, 
each possessing attenuation, scattering, and absorption parameters dependent 
upon the position of the zone: 


Ax 


A’ |A B 


T he distance x is measured from the fixed plane B to the variable plane A. 

Let parallel rays, the flux of radiation traversing unit length being 2S, fall upon 
the boundary line A. Denote by u the cosine of the angle formed by the direction 
of the incident rays and the inward-drawn normal to A, and by v the cosine of the 
angle between the outward-drawn normal and a particular scattered ray. We 
wish to determine Sr(v, u, x), the intensity of light reflected in the direction 
v. We assume that the ratio of scattering to absorption in the layer between x 
and x + Az is A(x)/(1 — A(a)). 

The quantity r(v, u, 2 + Ax) is the sum of five terms of order zero and one in 
Az. All terms of smaller order are ignored, Following Ambarzumian,! we obtain 
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Ax =) , A(a) Ar 
T " 


A(x) Ax f'! (a) 
+ r(w, u, 7) dw + Ax | 
2 v Jo 2 « 


1 rl 
+ N)ar [ r(w, u, 2) dw | r(v, 
0 0 


Letting Ar — 0, this becomes 


Or(v, u, x) | l ; 
+ + r(v, u, x) = 1 u, xr) dw 
Uu v ) ov 


r(v, u, x2 + Ax) r(v, u,v) ( 


“ » 


Ox 


Introducing 


and performing a slight rearrangement, we obtain 


1 | OR(v, u, x) 4 ee. R e | 4 R dw 
(Vv, U, x = + (Ww, U, x 
A(x) Or oa a i 2 Jo A w 
l 
t J 
2 


os dz 
R(v, z, x) (3.4) 


I £ dw [{'! dz 
+ } R(w, u, x) R(v, z, x) 
$ Jo w 0 g* 


If R(v, u, x) satisfies this equation, so also does R(u, v, x). Taking for granted 


the uniqueness of solution, we write 


l OR(v, u, x) 4 l 1 ‘) R a ! R dz x 
- (v x = 22 
d(2) or u Po Sai Baa * 2Jo he " 
l dw 
1+ R(w, u, x) ; 
2 0 Ww 


Sane eee en i 


The further substitution 


Fs _— 


yields 


~ 


Bie 


OR(v, u, x) r 
Our 


Coen: ee 


Since R(v,u,0) = 0, we obtain 


Zz 
Riv, u, x) = f e 1/MuFI/@—”) Ny) glu, y) o(v, y) dy. 
0 


=a 
ieee 


ae 
i 
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It follows that $(u, x) satisfies the equation 


ess, 7 dv 
o(u,x) = 1+ = lf e Ulett /ye~y) A(y) O(a, y) Or, y) iy | . (3.9) 
zJ0 0 v 


This leads to a feasible computational solution which we shall discuss in further 
detail subsequently. 


''V. A. Ambarzumian, “On the Scattering of Light by a Diffuse Medium,’’ Compt. rend. Doklady 
Acad. sci. U.R.S.S., 38, 257, 1943. 

2S. Chandrasekhar, Radiative Transfer (London: Oxford University Press, 1950). 

*R. Bellman and T. E. Harris, “On Age-dependent Binary Branching Processes,’’ Ann. Math., 
55, 280-295, 1952; “‘On Age-dependent Branching Processes,’’ these PROCEEDINGS, 34, 601-604, 
1948. 

'S. Janossy, Cosmic Rays (London: Oxford University Press, 1950). 

° J. Hadamard, “Le Principe de Huygens,’’ Bull. Soc. Math. France, 52, 610-640, 1924. 

°K. Hille, “Functional Analysis and Semi-groups,’’ Am. Math. Soc., 1948 

7R. Bellman, “The Theory of Dynamic Programming,’’ Bull. Am. Math. Soc., 60, 503-516, 
1954. 

8G. G. Stokes, Mathematical and Physical Papers, Cambridge, 1904, Vol. IV: On the Intensity of 
the Light Reflected from or Transmitted through a Pile of Plates, pp. 145-156. Stokes’ method 
was rediscovered independently by R. K. Luneberg (The Propagation of Electromagnetic Plane 
Waves in Plane Parallel Layers {Research Report No. 172-3, June, 1947, New York University, 
Washington Square College]), and R. M. Redheffer (‘‘Novel Uses of Functional Equations,’’ ./. 
Rat. Mech. Anal., 3, No. 2, 271-279, 1954). 


ON MULTIPLICATIVE IDEMPOTENTS OF A 
POTENT SEMIRING* 


By SAMUEL BouRNE 
UNIVERSITY OF CALIFORNIA, BERKELEY, CALIFORNIA 
Communicated by H. S. Vandiver, June 26, 1956 
Bs INTRODUCTION 


A semiring is a system consisting of a set S and two binary operations in S 
called addition and multiplication such that (a) S together with addition is a 
semigroup; (b) S together with multiplication is a semigroup; (c) the left- and 
right-hand distributive laws a(b + c) = ab + acand (b + c)a = ba + ca hold. 

Semigroup is used in the sense of a closed associative system. R. Brauer men- 
tioned to Vandiver! that Dedekind concerned himself with such an algebraic sys- 
tem without giving the semiring a formal definition. The latter was first intro- 
duced by Vandiver.? The simplest example of a semiring is the system of count- 
ing numbers relative to ordinary addition and multiplication. Vandiver also gave 
an example! of a semiring which cannot be imbedded in a ring. Another sig- 
nificant example is the semiring of finite and infinite cardinal numbers* which 
cannot be imbedded in a ring. Independently, this system came to us‘ naturally 


as the totality of endomorphisms € of an arbitrary additive semigroup ©. A 


nonimbeddable semiring arises naturally when we take the totality of endomor- 
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phisms € of a nonimbeddable additive semigroup. If both semigroups of a semi- 
ring are commutative, we say that the semiring is commutative. An important 
example of a commutative semiring is a distributive lattice.6 In particular, the 
set of ideals of a commutative ring forms a commutative semiring relative to the 
operations sum and product.’ 

Our main purpose in this paper is to obtain an analogue for semirings of the 
fundamental lemma of structure theory in rings, that, if the left ideal { of a ring 
® with minimum condition is non-nilpotent, then { contains a multiplicative idem- 
potent.’ Clifford’ proved that in a semigroup © without nilpotent ideals ~ (0) 
and in which every two-sided ideal contains at least one left and at least one right 
minimal ideal of S, every left ideal L # (0) of S contains an idempotent element 
(0). We now show that this theorem remains true for semirings. 

This paper has benefited materially from discussion and correspondence with 
A. H. Clifford, of Tulane University, and H. Zassenhaus, of McGill University. 


2. ADDITIVE AND MULTIPLICATIVE IDEMPOTENTS 


Definition 1: An additive identity, called zero, is an element 0, such that 0 + 
s+ 0 = sforallsinS. 

Definition 2: An additive idempotent of a semiring S is an element a such that 
a + a = aanda multiplicative idempotent is an element m such that m? = m, 
m # 0. 

A. distributive lattice is an example’ of a commutative semiring in which each 
element is both an additive and a multiplicative idempotent. 

If S possesses a zero, then a0 = a(0 + 0) = 2a0 and 0a = 20a are additive idem- 
potents which are not necessarily equal. 

Example 1: Let S. be the noncommutative additive semigroup of order 2 with 
elements 0, 1 whose additive table is 


+ J 


oe J 
l eg 


Let & be the totality of endomorphisms of S. Then € = }0, J, A, B}, where 


(0, 1)0 = (0, 0), (0, IZ = (0,1), O, DA = (1, 1), and (0, LI)B = 0,0). The 
addition and multiplication tables in the semiring ©, are, respectively, 


+ 0 f A.B . Of, Ae 
0 0 A 2B 0 ® 0 A. A 
I 0 A B I O fA 8 
A 0 iB A O A Ax: O 
B 0 iB B 0 BAT 


Since 0A = A # 0 = AO, we have two distinct additive idempotents. 
Example 2: Let S, be the semiring of order 2 with elements 0, 1 whose addition 
and multiplication tables are, respectively, 
+ 0 | 


0 l 
I | 
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In this example S. possesses a zero element 0 in which 0? # 0. 

We shall assume that S possesses a zero element 0 and Os = sO = 0, forall sin S. 

Definition 3: A division semiring is a semiring in which the elements #0 form a 
multiplicative group. 


3. TWO-SIDED IDEALS CONTAINING MINIMAL RIGHT IDEALS 

For the sake of completeness, we repeat the definition of right ideal given else- 
where. * 

Definition 4: A right ideal S is a subset FR of S containing zero such that if 7, and 
r, are in R, then r; + rz is in R, and if ris in R and s is any element of S, then rs is 
in R. 

Definition 5: The sum over a given set A of right ideals R; is the smallest right 
ideal containing all the given right ideals. It consists of all finite sums }orj, 
where each r; belongs to one of the right ideals R;. We denote this right ideal by 
S°R;. This addition is both commutative and associative in the widest sense. 
In the case addition, in the semiring S, is commutative, our definition of the 
sum of right ideals becomes identical with the standard definition in ring theory; 
hence R, + Rz is the set of sums 7; + 72 with 7; in Ry and ro in Re. 

Definition 6: The product Rik, of the right ideals R; and R, of a semiring S is 
the set of all finite sums > rites with 7;,; in R, and ro; in Re. 

This multiplication of ideals is associative but not necessarily commutative. 


Also, the general distributive law >R;3°Q; = SR,Q, holds. 
t.7) 


Let us mention the fact that the intersection of any number of ideals of the 
same kind is again an ideal of the same kind. 


Definition 7: A right ideal N is said to be nilpotent if a power N” is equal to the 


zero ideal (0). 

Definition 8: A. semiring S is said to be potent if it contains no nonzero nilpotent 
right ideals and left ideals. 

LemMaA 1. Jf R ts a minimal right ideal of a semiring S, then tR, where tis in S, 
is either a minimal right ideal S or tk = (OQ). 

Proof: tR is a right ideal. Let A be a right ideal contained in tR # (0) and 
R, the set of all elements r; of R such that tr, isin A. Since t(ry, + 7.) = tre + 
tr, and t(ry,)S = t(m,)S are in A, we have that ry, + 7, and ry are in R,. 
This implies that R; ~ (0) is a right ideal contained in R. Therefore, Ri = R, 
tR; = tR©& A,and A = tR. This proves that ¢R is minimal. 

THEOREM 1. Jf I is a two-sided ideal containing a minimal right ideal of S, the sum 
J of all minimal right ideals of S contained in I is a two-sided ideal of S. If I is 
minimal, then it ts a sum of minimal right ideals of S. 

Proof: Since J is the sum of right ideals R;, it is a right ideal. We show that it 
is also a left ideal. Let j «J ands eS; then sj = s)cor; = Dosr;, where r; belongs 
to some minimal right ideal R; contained inJ. Now sr; esR;, and sR; is, by Lemma 
1, a minimal right ideal contained in /, or sR; = (0). Hence, in either case, s7 € J 
and J is a left ideal. Obviously, if J is minimal, J = J. 

THEOREM 2. If M is a minimal two-sided ideal of a potent semiring S, containing 
a minimal right ideal of S, then every minimal right ideal of S contained in M is a 


minimal right ideal of M. 
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Proof: Let R # (0) bea minimal right ideal of S contained in M, and A a right 
ideal #(0) of M contained in R. Now AM is a right ideal of S, and AM ¢ RM 
c Rk. Since RF is minimal, this implies that R = AM ¢ A and R = Aor AM = 
(0); then AS ¢ RS c R, and either AS = (0) or AS = 







(0). Suppose that AM 
R. Now AS = (0) says that A is a right ideal of Sand A = R. If AS = R, 
then R? ¢ ASAS ¢ ASMS ¢ AM = (0), which contradicts the potency of S. 
Hence AM cannot be (0). It follows that R = A; hence we have shown that a 
minimal right ideal of S contained in M is a minimal right ideal of M. 

However, it is unlikely that the hypothesis of Theorem 2 will imply that every 
right ideal of M is a right ideal of S. As yet we have not found a counterexample. 

If we are willing to restrict ourselves to semirings in which the lattice of right 
The modu- 









ideals is modular, then every right ideal of M is a right ideal of S. 
larity of the lattice of right ideals implies that this lattice satisfies the exchange 





lemma.!! 
THEeorEM 3. If Sis a potent semiring in which the lattice of right ideals is modular 


and M is a minimal two-sided ideal containing a minimal right ideal of S, then every 






right ideal of M is a right ideal of S. 

Proof: We show that 2M = xS for x eM. By Theorem 1, M - > Ri where 
R, « A, the set of minimal right ideals of S contained in M. Therefore, 2 = x2, + 
Xe + + x,, n finite, and R;e A. cS cmS +arS 4+ ... + 2,8 ¢ Ri + Rp 
-+- + R,. By the exchange lemma, we can renumber Rj, Ro, ..., R, so that 
Ri+Rz+...+R, =Rit+... +R + Rit... + Ra, where Ri’ + R,’ 
+...+R,' = 2xS,holds. We replace R,’ by R,, fori = 1,2,...,7, so that cS = 
Ri +R. +...+ R;. Since S is potent, we have (0) ¢ R? ¢ R,;. The mini- 
mality of R; implies R;? = Rk; Hence xrS > rM 3 aSM = (Ri + Ro +... 4+ 
Rj)M>R?P+R2+...+ Ri +Reo+...+R;, = 278. WehavezrM = 
xs. 

Let R be a right ideal of M. Then RS 

We give an example of a potent semiring which is not modular. 

Let S, be the semiring of order 4 with elements 0, a, b, ¢ whose addi- 









RM c R, and R isa right ideal of S. 









Example 3: 
tion and multiplication tables are, respectively, 







0 
0 
0 
0 
0 












Then R = {0, a}, Ri - 10, b,c}, Re = 30, b| are right ideals which fail to satisfy 


the modular identity, i.e., Ri n (2+ R) 4 Ro + Ron R. 








POTENT SEMIRINGS IN WHICH TWO-SIDED IDEALS CONTAIN A MINIMAL 
AND A MINIMAL LEFT IDEAL 





RIGHT IDEAL 





LemMa 2. Jf M is a minimal two-sided ideal of a potent semiring S and if M con- 
tains a minimal right ideal and a minimal left ideal of S and if R is a minimal right 
ideal of S contained in M, then there exists a minimal left ideal L of S contained in M 


such that LR = M and RL # (0). 
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Proof: Since the right ideal RM c R,RM = Ror RM = (0). If RM = (0), 
then R? c RM = (0), which contradicts the potency of S. We have RM = R. 

Since the two-sided ideal MR c M, MR = M or MR = (0). As before, the 
potency of S excludes MR = (0). We have MR = M. 

Theorem 1 implies that M is the sum of all minimal left ideals Z of S which are 
contained in M. If LR = (0), for all such L of S, then also MR = (0), contra- 
dicting MR = M. Thus there exists a minimal left ideal L of S contained in M 
such that LR ~ (0). Since the two-sided ideal LR c M, LR = M. If RL = 
(0), then R = RM = RLR = (0). Hence RL # (0). 

In the following sequence of lemmas we let M be a minimal two-sided ideal of a 
potent semiring S which contains a minimal right ideal and a minimal left ideal of S, 
and R be a minimal right ideal and L be a minimal left ideal contained in M, such that 
LR = M and RL = (0). Then 

LemMa 3. The equations sx = 82 and ys, = 8 are solvable in RL, where s; ¥ 0 
isin R n Land sis in RL. 

Proof: Since s; is in L, Ms, is a left ideal of S contained in M. Therefore, Ms; = 
Lor Ms, = (0). If Ms, = (0), s: ¥ 0, then the set X of all elements x of M for 
which Mx = (0) contains an element #0. Moreover, X is a two-sided ideal of S. 
X is evidently closed under addition. Now (0) = Ma > (MS)x = M(Sz). This 
implies that Sx ¢ X. Similarly, M(aS) = (Mx)S = (0) and xS ¢ X. Thus 
X2 ¢ MX = (0) and X = (0) ands: = 0. Wehave Mg, = L. Since s, is in R, 
8 FR is a right ideal of S contained in R. Therefore, 3,2 = R or 3k = (0). If 
sk = (0), then M = LR = Ms,R = (0). Wehave sk = R. Finally, RL 


RL, and the equation s;7 = s2 is solvable in RL. In exactly the same manner we 


prove that the equation ys; = s is solvable in RL. 
Lemma 4. RL is a division semiring. 
Lemma 5. L = Se, R = eS, and R n L = eSe, where e is the identity of the di- 


vision semiring. 

Lemma 6. Roa L = RL. 

Lemma 7. The identity element e of the division semiring RL is a primitive idem- 
potent. 

The proofs of Lemmas 4-7 are straightforward and analogous to those given by 
Clifford,’ and they are therefore omitted. 

If each two-sided ideal 4 (0) in a potent semiring contains a minimal right ideal and 
a minimal left ideal of S, then 

LemMa 8. Each two-sided ideal 4 (0) of S contains a minimal two-sided ideal of S 
(the smallest two-sided ideal containing a minimal right ideal R of S that is contained 
in the given two-sided ideal). 

Proof: Let A be a two-sided ideal of S, and FR a minimal right ideal of S which is 
contained in A. Then SR is a two-sided ideal of S contained in A. Since R? = 
R # (0), then SR # (0). We prove that SR is minimal. Let B be a two-sided 
ideal #(0) such that B ¢ SR. Then RB is a right ideal of S such that (0) ¢ 
RB c¢R. Since R is a minimal right ideal of S, then either RB = R or RB = (0). 
If RB = (0), then SRB = (0) and B? = (0). The potency of S requires B = (0), 
which leads to a contradiction. Hence R = RB ¢ SB ¢ BandSkR c¢ SB c B. 
Previously, B ¢ SR. We have then B = SR. 

Lemma 9. Each minimal right ideal of S is contained in a minimal two-sided ideal 
of S. 
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Proof: Immediate consequence of the previous lemma. 

Lemma 10. Each right ideal #(0) of S contains a minimal right ideal of S. 

Proof: If R isa right ideal ~(0) of S, then SR # (0), because there are no right 
annihilators #(0) of S. By assumption, SR, a two-sided ideal of S, contains a 
minimal right ideal R; ¥ (0). As before, the set X of all elements x « SR which are 
right annihilators of SR is a two-sided ideal of S. Now X2 ¢ SRX = (0). The 
potency of S requires that XY = (0). This implies that SRR; ¥ (0) and RR, # (6). 
Therefore, there exists an element ¢ in R such that ¢R; ~ (0). By Lemma 1, tk; 
is a minimal right ideal ~ (0) of S contained in R. 

We are now ready to prove the main result of this paper. 

THeoreM 4. If S is a potent semiring in which each two-sided ideal contains a mini- 
mal right ideal and a minimal left ideal of S, then any right ideal R # (0) contains a 
multiplicative idempotent. 

Proof: Lemma 10 yields a minimal right ideal R, in R. This minimal right ideal 
is contained in a minimal two-sided ideal M of S, by Lemma 9. Lemma 2 gives a 
minimal left ideal L of S such that LR, M and RL # (0). Rol is a division 
semiring, by Lemma 4. Its multiplicative identity e ¥ 0 is the multiplicative idem- 
potent #0 contained in R. Lemma 5 states that Ry = eS. 

We conclude this paper with the following observations. Using Euler’s argu- 
ment, we can show that if S is a semiring and if R is a minimal right ideal of S con- 
sisting of finitely many elements, then either R? = (0) or R contains an idempotent 
element. Hence the fundamental lemma of structure theory in rings remains 
true in finite semirings with minimum condition. However, this-lemma does not 
remain true in any semiring with minimum condition. For, when we change the 
order of multiplication, the semigroup S given by Baer and Levi'* is an example of a 
semigroup with right division but containing no idempotent element. We imbed 
S into a semiring S as follows: S = 10, S t, with ss. as in SG if s; ¥ Oand s ¥ 0, 
8182 = O otherwise, s; + so = 82 if s% in S, and s; + s% = 8 if s = 0. Then Sisa 
potent semiring without an idempotent element and one in which there is no right 
ideal different from S and 0. The question whether the condition, in Theorem 4, 
that S be a potent semiring is necessary remains unanswered. 
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GENERALIZED RESOLUTIONS OF THE IDENTITY FOR CLOSED 
SYMMETRIC ORDINARY DIFFERENTIAL OPERATORS 
By Ear A. CoppInGron* 
MATHEMATICS INSTITUTE, UNIVERSITY OF COPENHAGEN, AND UNIVERSITY OF CALIFORNIA, 
LOS ANGELES 
Communicated by Einar Hille, June 7, 1956 

|. Jntroduction.-It is the purpose of this note to outline how any generalized 
resolution of the identity for a closed symmetric ordinary differential operator 
may be expressed in terms of a basis of solutions of a differential equation and a 
certain spectral matrix. This result implies a corresponding expansion theorem 
and isometry with a transform space. The proof includes the case of a self-adjoint 


operator, and the results reduce in this case to the known ones, except that, of 


course, the isometry involved in the self-adjoint case is a unitary mapping. 
We consider a formal ordinary differential operator 


L = poD* + pD*" + ... +m, 


where D = d/dz, the ~, are complex-valued functions of class C"~* on an open 
real interval a <2 <b(a = —& orb = +, or both, are allowed), and po(x) ¥ 0 
on (a, b). We also assume that Z is formally self-adjoint, i.e., L coincides with 


its Lagrange adjoint 
L+ = (—1)"Dhp) + (—1)"~'D" "by +... + pp. 


Let X be the Hilbert space of all complex-valued functions on (a, b) whose magni- 
tudes are square summable on (a, b). We denote by ® the set of all u € which 
have continuous derivatives up to order n — 1 on (a, b), u"~ is absolutely con- 
tinuous on every closed subinterval of (a, b), and Lu e #. Let T be the operator 
in H with domain 9, and Tu = Lu, u € D. 

For u, v « D we have Green’s formula, 


S% (alu — uly) = [uv](x) — [ur](y), 


where [uv](2) is a certain form in (u, wu’, , uD) and (pv, v’, 

follows from Green’s formula that [uwv](x) tends to a limit as x tends to a or b. 
These limits are denoted by [wv](a) and [uv](b), respectively, and we let (aw) = 
fuv](b) -- [uv](a). Therefore, (uw) = (Lu, v) — (u, Lv), where ( , ) is the 
inner product in #. Let Do be the set of all uw « D satisfying (uv) = O for all 
v e D, and define the operator 7) by prescribing its domain as Do, and Tyu = Lu, 
u¢€ Do. Then 7) is closed, symmetric, its adjoint 7)* is 7, and T* = Ty. Ina 
sense, 7’) is the smallest closed symmetric operator in 3C which is associated with 
L. If 7; is any closed symmetric operator in 5 with domain 9; which contains 
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all funetions in & which vanish outside closed bounded subintervals of (a, 6) and 
such that Tu Lu, u € D, then 75 © 7,.' 

We consider any generalized resolution of the identity F associated with the 
operator 7’). A generalized resolution of the identity F is a one-parameter family 
F iF(A), ~2 <A < +} of self-adjoint operators in 3C satisfying (a) F(A) — 
F(u) is positive bounded if \ > wu; (b) F(— ©) = 0, F() = J (the identity opera- 
tor); and (c) F(A + 0) = F(A). If the further restriction F(A) F(u) = Fv), 
v = min (A, yw), is imposed, then / becomes an ordinary resolution of the identity. 
The family F is associated with 7) if for any wu € Do, v € H, 


(Tou, v) = peal Ad(F(A)u, v), T oul}? S AM d(F(A)u, wu). 


The operator 7’) can always be extended to a self-adjoint operator H in a (possibly 
larger) Hilbert space 3(;, and each generalized resolution of the identity F is of the 
form F = PE, where E is a resolution of the identity for an appropriate H and P 
is the projection of 3(; onto 3. Conversely, if F = PE, where P and EF are as 
above, then F is a generalized resolution of the identity for 7'y.? 

Corresponding to any generalized resolution of the identity F for T> there is a 


unique generalized resolvent R(1) given by 


(R(Df, g) | dFOS, 9) (f,geX; 91 0). (1) 
Js iphiag Jy 
Using a characterization of generalized resolvents due to A. V. Straus,’ we indicate 
how the range of R(l) can be prescribed by certain boundary conditions involving 
the parameter /. Using this, and the existence of a right inverse for 7 — 1, which 
we obtained earlier,‘ we show that R(/) is an integral operator of Carleman type, 
with a kernel having well-defined properties. By writing this kernel as the sum 
of a certain fundamental solution for (LZ — J)u = 0 and a second kernel, we then 
indicate that an inversion of equation (1) can be carried out to yield the expressions 
for the F(A). As indicated above, these expressions immediately imply an expan- 
sion result and Parseval equality. 

To our knowledge, the only results of this type which have been obtained for 
F which are not ordinary resolutions of the identity are those due to A. V. Straus® 
and ourselves. A. V. Straus considered the case L = —DpD + qon0 <2< @, 
where p and q are real and the limit-point case prevails at infinity. In this case 
‘only boundary conditions at the origin enter. He obtained the expressions for 
all F(A) in this situation. Our present method may be considered as the appro- 


priate extension of his. In the case of a maximal symmetric extension of 7’ 
(which always exists in 3C) there is a unique associated generalized resolution of 
the identity, and we have determined® the expressions for the F(A) by a quite 


different method. 

Detailed proofs will appear elsewhere. 

2. The Generalized Resolvent—Let F be an arbitrary, but fixed, generalized 
resolution of the identity associated with 7, and let R() be the corresponding 
generalized resolvent given by equation (1). Let &(/) denote the eigenspace of 
T for l, i.e., &(0) is the set of all we O such that Tu = lu. If wt = dim &(2), w= = 
’ dim &(—2), then it is known that dim &(/) = w+ if gl > 0 and dim &(l) = w7 if 
gl <0. According to A. V. Straus,? every R(/) has the form 
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Ril) (Tam — 0) (gl > 0), RQ) = (R(D)*. (2) 


Here A(/) is an operator taking &(—7) into &(2), which is analytic for gl > 0 and 
whose norm is less than or equal to one. The operator 7'4,) is the quasi-self- 
adjoint extension of 7’) generated by A(/), that is, the operator whose domain 
(1) is the set of all wu ¢ © having the form u = w + (J — A(l))u-, where wy € Do, 
u~ €&(—7), and J is the identity operator on &(—7), and which is defined by 7'4 q)u 
Tu (= Lu) for ue O(2). Conversely, given any such A(l), the operator (744) — 
!)~! for gl > O determines a generalized resolvent R(/) for 7) via equation (2). 

Let oy, , ¢,+ be an orthonormal basis for &(2), and let yy, , Wo- be one for 
&(—7). Define the functions v,(1) and v,*(1) by 


v(t) = ¥, — A(Dy,; (7 = 1, er) 
v,*(l) = o, — (A())*¢; (7 = 1,....;.w*). 


THeoreM 1. For gl > 0, the domain D(1) of T.4, ts the set of all u € D satisfying 
(uv;*(D) = 0,7 = 1, , wt, and the domain D*(1l) of (T'4q))* ts the set of all 
ue D satisfying (uv,(l)) = 0,7 = 1, ,w 

Now consider a right inverse of 7 — | of the type we constructed in a previous 
work.* This is an integral operator G(l) of Carleman type with bound |G(J)|| S 
gl\-! and a kernel G = G(z, y, 1) having the property that 0’**~*G/Ox’~'dy*~! 
(7,k = 1, , n) are continuous in (2, y, 1), and analytic in /, on any region for which 
gl ~ 0. An exception occurs at x = y when / or k is n, due to the well-defined 
jump there. For fe X,u = R(Df — G(Df « D and (T — lu = 0, which implies 
R(Df — Gf « &(). Using the additional fact that (R(Dfv,*()) = 0,7 = 1, 

, wt, we find that R(l)f may be written as 


ROS = GOS + ¥& XO SOS, WD) & (gl > 0), (3) 
j=1k=1 


where ¢,(/), , +(1) and ¥,(l), , w.-()) are bases for &(l) and &(1), respec- 
tively. From this representation we can deduce the following result. 

TueoreM 2. The generalized resolvent R(l) is an integral operator of Carleman 
type having a kernel K = K(x, y, 1) with the property that 0! +*~*K /dx’~'dy*—! (7, k = 
1, ...,”) are continuous in (x, y, 1), and analytic in l, on any region for which gl ¥ 0, 
except for x = y, when j or k is n. 

The analytic nature of the kernel follows by introducing analytic bases for 


s(l) and &(1). If glo > 0, then ¢,(I), ¥,(0) defined by 


o(L) 


T+ (lb — bk) GD] dell) (k = 
¥() = 1 +(—|h) 


GD Wy(h) (j = 


will be analytic bases for &(/), &(2), respectively, if |1 — lb! < |9lo|/2. Here 
(ly), w)(lo) represent any basis for &(l), &(o), respectively. Using these bases 
in equation (3), it can be shown that ®, is analytic for 1 — ly < gly /2. 

We now rewrite the kernel K in a different way, by letting K = Ky + Ay, where 
Ky is a certain fundamental solution for (LZ — lu = 0, which we describe. Let 
si(l), ..., 8,(l) be a basis for the solutions of (L — l)u = 0 satisfying s;*~(e, l) = 
5, for some c,a <c¢ <b. Then the s,“~"(a, l) are entire in / for any fixed x in 
(a, b). Also, [s,(L) s,(7)](x) is independent of x and I and will be denoted by S,,. 
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The matrix S (S,,) is nonsingular; let S GS denote its inverse. We 
define Ky by 


Ko(2, y, L) Koly, x, l l/s S, s(x, s(y,D) (y = 2). 


Using this, it follows that A, can be written : 


K,(2, Y, l) 2 W,;(l) s(x, Ll) s LY, l), 
‘i l 
and the following theorem obtains. 
THEOREM 3. The matrix V (W,,) ts analytic for Jl A 0, ¥*(L) = WD, and 
IV (l)/dl = O, where gv (YW — W*) /2z. 
The proof results from noting that 


ra) +h 


(IW) ,,.(1) (oe, €, DB), (4) 
Ox? ; 
where H(x, y, l) = (K(a, y, ) — K(a, y, 1))/27. The positivity of the matrix on 
the right-hand side of equation (4) for gl > 0 follows by applying the inequality 
(9(R(Df, f)/sl) 2 | R(Df |? to suitable functions f 
3. The Generalized Resolution of the Identity—From Theorem 3 we can assert 
that W(/) has the representation 
“7 M+ 1 " 
W(l) a+ 6l + | da(r (dl € 0), (5) 
. A-l 
where a, 8 are Hermitian constant matrices, 8 2 0, and o is a Hermitian matrix 
function of bounded variation which is nondecreasing, in the sense that (¢&, &) is 
monotone nondecreasing for any complex n-dimensional vector ~ This assertion 
follows by applying the known theorem’ for scalar functions to the form (Wé, &). 


From the representation (5) the limit 


a | 
p(A) lim | IV(v + te) dp 


e>+0 TIO 
exists for any fixed real \, and the matrix p is nondecreasing and of bounded varia- 
tion on every finite \-interval. Using the representation K = Ky + K;, and the 


inversion of equation (1) 


ic 
(F(A)f, g) lim / I(R(v + te)f, g) dv, 
T/A 


e> +0 


where A = (u, A] and A, w are continuity points of /, we arrive at the following 
representation for F’. 
Tueorem 4. If f € KH and vanishes outside a closed bounded subinterval of (a, 6) 


and X, uw are continuity points of p and F, then 
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n 


F(A)f(x) | y 8)(x, A) F(A) dpy(Ad), 
JA 


j,k l 


whe re bP (A ) = (f, Si (A )). 
ete =; . 25 Cy), n = (m, .7,) be vector functions of A, and introduce 


the inner product 


n 
(6,0) = D(A) Se(A) dod) 
—~o j,k=1 
and norm |¢) = (¢, )%. Let L°(p) be the Hilbert space of all ¢, measurable 
with respect to p, such that ||¢) << ©. Then an immediate consequence of Theorem 
4 is the following expansion result and Parseval equality. 
TueoreM 5. If f € 5, the vector f = (f,;) converges in norm in L*(p) and |\f\| = 
ff. In terms of this f, 


f(z) = dX sj(a, d) f(A) dp (A), 
-~o j,k=l 


where the integral converges in norm in KX. 


* The author is a Fulbright grantee. 

1 For the facts concerning 7’) and 7’ see E. A. Coddington, ‘‘The Spectral Representation of 
Ordinary Self-adjoint Differential Operators,’’ Ann. Math., 60, 192, 1954. 

2 For a discussion of generalized resolutions of the identity and generalized resolvents see, e.g., 
N. I. Achieser and I. M. Glasmann, Theorie der linearen Operatoren im Hilbert-Raum (Berlin, 1954) 
(German translation). 

3A. V. Straus, “Generalized Resolvents of Symmetric Operators,”’ Izvest. Akad. Nauk S.S.S.R., 
Ser. Mat., 18, 51, 1954. 

‘E. A. Coddington, ‘The Spectral Matrix and Green’s Function for Singular Self-adjoint 
Boundary Value Problems,’’ Can. J. Math., 6, 169, 1954. 

5A. V. Straus, ‘On Spectral Functions of Differential Operators,’’ Izvest. Akad. Nauk S.S.S.R., 
Ser. Mat., 19, 201, 1955. 

6 &. A. Coddington, ‘On Maximal Symmetric Ordinary Differential Operators,’’ Math. Scand. 
(to appear). 

7 See, e.g., M. H. Stone, Linear Transformations in Hilbert Space (New York, 1932), Theorem 
10.36, p. 573. 


CAUCHY’S PROBLEM FOR LINEAR DIFFERENTIAL EQUATIONS 
WITH CONSTANT COEFFICIENTS* 
By Leon EHRENPREIS 
INSTITUTE FOR ADVANCED STUDY 
Communicated by Deane Montgomery, June 27, 1956 

Global solutions for Cauchy’s problem in linear partial differential equations 
have been essentially restricted to the hyperbolic case. In the present note, we 
wish to show what the well-posed Cauchy problem is for linear equations with con- 
stant coefficients which may or may not be hyperbolic and which may even be para- 
bolic or irregular (see below). As a by-product of our investigations, we shall give 
a new proof of Garding’s theorem that Cauchy’s problem is well posed in the C* 
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theory if and only if the differential operator is hyperbolic.' Also, our methods 
are very amenable to explicit calculation of Green’s functions and the solution of 
Cauchy’s problem. 

In this note we shall restrict ourselves to the case where the initial hypersurface 
is a hyperplane. In forthcoming papers we shall extend the results of this note in 
many directions, including the replacing of Cauchy’s problem by more general types 
of boundary-value problems and the giving of boundary-value data to systems of 
equations which may even be degenerate.” 

Let us denote by R (C) real (complex) Euclidean space of dimension n + 1. We 
denote the co-ordinates in R (C) by t, 2, t2,..., <n. We also denote by R’ (C’) a 
real (complex) Euclidean space with co-ordinates s, 2), 22,...,2,. We shall some- 


times write z for (2, t,) and zfor (2), ..., Zs). 
By & we denote the space of C™ functions on R; & is the subspace of & consisting 


of those functions which are independent of ¢. We topologize & and & according 
to the method of L. Schwartz. &’ is the dual of &, E’ is the Fourier transform? of 
&’, and E is the dual of E’. £E,’ is the Fourier transform of the dual &’ of &, and 
E, is the dual of Ey’. We consider Ey’ and E’ as spaces of functions on C’; E’ con- 
sists of all entire functions of exponential type which are of polynomial growth 
on R’. The Fourier transform is defined in such a manner that Eo’ consists of 
those functions of E’ which are independent of s. 

By 5 we denote the space of entire functions on C with its usual topology,’ 3C’ is 
the dual of 3¢, H’ is the Fourier transform of 3c’, and H is the dual of H’. The 
spaces 3Co, Ho’, Ho’, Hy bear the same relation to 3, 5’, H’, H as the spaces &>, 
0’, Eo’, Ey do to the respective spaces &, &’, EB’, E. 

Now let D be a partial differential operator with constant coefficients. We 
assume first that D is regular (with respect to ¢); that is, we may write D = 0”/Ot™ + 
D,(o"~*/at™~") +... + Dp, where the D, are partial differential operators which 
are independent of ¢. Cauchy’s problem then consists of finding a function f of 
(t, x) which satisfies 


Df = g, 
o*f(z) 


at = 9,(x) fork = 0, 


where g is a given function of (¢, x) and the g, are given functions of z. 

1. Nonparabolic Equations.—We assume that g « K and that the g, are in Xo; 
we seek a solution f « H. We write D’ for the adjoint of D. Then equation (1) 
states that D’/W-f = W-Df = W-g for any W eX’. That is, on identifying 3 
with the dual of 3’ (which is permissible, since 5 is reflexive), f is prescribed on the 
subspace D3’ of 3’, namely, 

f-D'W = g-W. (3) 


Equation (2) tells us that, if 7 « K’ is of the form T = Ty + (0/0t)T; +... + 
(o"~*/ot™—")T,_1, Where T; € Ho’, then the value f- 7 is given by 


T= Dogue Tr. (4) 


Let us take the Fourier transform of conditions (3) and (4). We denote by P 
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the Fourier transform of D; hence P is a polynomial which we may write as P(s, 
n 1 ° - ‘. 

z) = 8s" + Py(z)s" +... + P» (2), where the P; are polynomials. We are looking 

fora U e H which satisfies 


U-PF = V-F (5) 


for all F « H’, where V ¢ Ais given, and, if G e H’ is of the form G(s, z) Go(z) + 
Gi(z)s +... + Gn_s(z)s™~ |, where G, € Ho’, then 


U-G > VaeG, (6) 
where the V;, are prescribed elements of Hp. 
The solution of our problem would be complete if we could show that every 
K ¢ H’ admits a unique decomposition in the form 


m l 


K= > Gwe’ + PF, (7) 


j=1 


where the G,; « Ho’ and F « H’ depend continuously on A; if this is the case, we say 
that D is nonparabolic (with respect to 4). The decomposition (7) is the analogue 
for the space H’ of the Euclidean algorithm. 

A simple argument shows that any K e H’ admits a decomposition in the form 
(7), where the G, and F are entire functions on C’ which are uniquely determined 
by A; we want to know when G; and F lie in H’. ‘To answer this, we note that the 
(7, are determined by the values of A on the algebraic manifold M defined by the 
equation P = 0 if P has no multiple polynomial factors, or else by the values of K 
on M together with certain derivatives of K on parts of M. We can easily see that 
there exists a polynomial P°(z) (the discriminant of P) such that, for each 7, P°G;, 
can be expressed in the form 


m 


P%(z) G,(z) > K(s,(z))Qj(s,(2)) (8) 
7 1 


if P has no multiple factors (with a similar expression if P has multiple factors, 
except that some of the derivatives of AK appear on the right-hand side of eq. 8). 
Here the Q; are polynomials in one variable, and the s;,(z) are the zeros of the equa- 
tion P(s, z) = 0. Now H’ is the space of all entire functions of exponential type. 
From equation (8) it follows that P°G, will be in Hp’ if, for all r, s,(z2) = O()2z!). 
From P°G, ¢ Ho’, we deduce® also that G; ¢ Hy’, and, from equation (7), also that 
F ¢ H’. The same argument also shows that the G; and F depend continuously 
on K in this case. 

Conversely, if we do not have s,(z) O(\z|) for all r, then we can construct a 
K ¢€H’ such that, for some 1, G, ¢ Hp’. 

The condition s,(z) = O() 2) ) for all r is easily seen to be equivalent to the condi- 
tion that m be the degree of the polynomial P. Thus we have 

THEOREM |. D is nonparabolic if and only if m is equal to the order of the differ- 
ential operator D. 

Whether D is parabolic or not, the set of AK of the form (7) with G, and F in H’ 
contains all polynomials and so is dense in A’. It follows from topological argu- 
ments that we can find g and g; such that Cauchy’s problem is not solvable if D is 


parabolic. 
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Turorem 2. Jf D is nonparabolic, then Cauchy's problem is well posed in %, 
that is, for any g € 5, gj € Ho, there exists a unique solution in 3K to equations (1) and 
(2) which, moreover, depends continuously on g and the g;. In the case that D is para- 
bolic, we can find ag € 3% and q; € Ko such that equations (1) and (2) have no solution 
in. Inany case, equations (1) and (2) have at most one solution in 3 for gj, g € KR. 

2. Hyperbolic Equations.—We are looking for those operators D for which we 
have an analogue of Theorem 2 above for & in place of 5. D is called hyperbolic 
(with respect to é) if, for each K ¢ E’, we have an expansion (7), where the G; and 
F lie in E’ and where the G’ and F depend continuously on K. We repeat the same 
type of argument as above for & instead of 50; we want to find those operators D 
for which, as in equation (8) above, 

P°(z)G ,(z) 2 K(s,(z)) Q)(s,(z)) (8’) 
r=1 
(with suitable modifications if P has multiple factors) lies in Eo’ whenever K e E’. 
It follows from the definition of the space E’ that this will be the case if, for all r, 
§(s,(z)) = OC + | S(z)|), where ¥ denotes the imaginary part. Again,* if P°G; € E’, 
then G, « E’ and hence also F ¢ EB’. 

On the other hand, if, for some r, 9(s,(z)) # O(L + | 9(z)| ), then we can construct 
aK ¢€ E’ andan/such that G,¢ E’. 

THEOREM 3. D is hyperbolic if and only if 9(s,(z)) = OU. + | 9(z)| ) forall r. 

Contrary to the case of the space H’ above, the set of functions K ¢ £’ which 
have an expansion (7) with G;, F « E’, may not be dense in E’. This set will be 
dense if and only if the hypersurface ¢ = 0 is not a characteristic? of D. 

TueoreM 4. Jf D is hyperbolic, then the Cauchy problem is well posed for BE. If 
D is not hyperbolic, then either t = 0 is a characteristic of D, or we can find g « & and 
q; € & such that equations (1) and (2) have no solution in &. 

Remark: It is not difficult to show that every hyperbolic operator is nonpara- 
bolie. 

3. Nonparabolic, Nonhyperbolic Operators.—Let 0 < a S 1 be given; we choose 
the space $, which lies between H and £; §, (which is a space of non-quasi-analytic 
functions) is so defined that the Fourier transform G,’ of its dual consists of all 


entire functions of exponential type F which satisfy F(s, z) = O(e***'#"*) Q(s, 2) 


for (s, z) e R’, for some polynomial Q. Then, by an analysis like that above, we 
can find those operators for which we have a well-posed Cauchy problem for the 
space 5,. 

t. Parabolic Operators.—In order to get a well-posed Cauchy problem, we have 
to consider spaces for which the Fourier transform of the dual is no longer a space 
of entire functions of exponential type. Examples are given by the spaces Q4 of 
entire functions of order SA and by many of the spaces which arise in the theory 
of infinite derivatives.* The important point to note is that, in the case of non- 
parabolic operators, the Cauchy data g, g, are restricted only from the point of 
regularity, while in the parabolic case the g and g, are restricted in regularity and 
growth at infinity. This theory is, of course, well known for the heat equation. 

5. Irregular Operators.—If D is irregular, i.e., D = Do(0”"/ot”) + Der" 
ot” 1) +... + Dy, where the D, are linear differential operators in x with constant 
coefficients, Dp # 0, 1, so that P = Pos” +... + P», where the P; are polynomials 
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in z with Py # 0, 1, then we cannot hope to get an expansion (7) in general. What 
has to be done is to add to the right-hand side of equation (7) some terms which de- 
pend on the polynomial Py. Thus the “initial surface” for the Cauchy data gets 
replaced by several surfaces. The method then proceeds as before. 


* Work supported by National Science Foundation Grant NSF 5-G1010 at the Institute for 
Advanced Study. 

'“Tinear Hyperbolic Equations with Constant Coefficients,’’ Acta Math., 85, 1-62, 1951. 
Actually, our result (Theorem 4) is more powerful than G&rding’s because he had to assume that 
the Cauchy problem is well posed for & in order to deduce that D is hypberbolic. 

2 L. Ehrenpreis, “Solutions of Some Problems of Division. I,’’ Am. J. Math., 76, 883-903, 
1954. 

> L. Ehrenpreis, “Solution of Some Problems of Division. III,’’ ibid. (to appear), and “Theory 
of Infinite Derivatives’’ (to appear). 

* L. Hérmander, “General Partial Differential Operators,’’ Acta Math., 94, 161-248, 1955. See 


especially pp. 216-217. 


A UNIFIED THEORY OF GENERAL RELATIVITY OF GRAVITATION 
AND ELECTROMAGNETISM. II 
By Luruer P. E1rsennart 
FINE HALL, PRINCETON UNIVERSITY 
Communicated June 28, 1956 
This is another proposal of a unified theory.!. The theory is based upon a sym- 


metric tensor g;;, a coefficient of connection I‘, and an electromagnetic vector 


¢:, in terms of which there is the skew-symmetric electromagnetic field tensor 


(1) 


Here, and throughout the paper, a comma followed by an index denotes the deriva- 

tive with respect to x with this index, and a semicolon followed by an index denotes 

the corresponding covariant derivative in terms of the Christoffel symbols in gj,;. 
From equation (1) it follows that 


F'ij,2 + F 5x. -+- Fi, = 0, (2) 
from which one has 
F ij + F 5x1 = Fi: a Q). (3) 


It being required that the determinant g of g,; be different from zero, and in fact 
negative, a contravariant symmetric tensor g” is defined by g’gy, = 6 where 
65'. is the Kronecker delta. 

For any coefficients of connection I';, one has 


iw = et + apr, (4) 


where a, is a tensor, as follows from the equation for the Christoffel symbols 
\‘mt and I*,, separately in two co-ordinate systems.? 
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For coefficients of connection I',, there is the generalized tensor® 
to) 


’ wt v vk vw sk . 
Mint I ‘LP I h,l + I il | I inl x1 
For I",, defined by equation (4), this is 
(6) 


Rint + Ayr -r — 


i k 2 k 
Qjn +1 + Gj; Akn — Ayn Axi, 
Contracting equation (6) for 7 and J, one has 


vw 
I" int 


where #',,, is the Riemann tensor. 
(7) 


i k k 
Rin + Qji' nn — Ap’. Ann — An Ani, 


LP jn - I ini ™ + a 
where F,, is the Ricci tensor. 
Thus far the treatment is the same as in the former paper.! The new proposals 


follow. 
We require that 
0, 


jh 
where I’, denotes the symmetric part of Ty, From (7) this is 


1 1 k 1 k i 
V/s» (ji cn H+ Gni 23 + G54 Aen + Ani Ax; ) 
(9) 


1 k k 
+ 2 (Qin si + An; - ) + : 2\Ajn + Onj Ay; 


R 5 3 = 


We take 
Gn = Fyd, (10) 
where for the present the vector \' is not specified. In terms of quantities of the 
type (10), equation (9) is 

Rin = 2/2 [Foster + Adela) + Fig Ae) ] + 1/2 (Fun + Fay). (11) 


We take 
New = Adela”. 


Equation (11) is now 
Ur (Fign + Fin). 


tin = 


From equation (12) one has 


(A, 2 = 2A, 


which is satisfied when we take 
NA, = 0. 


From equation (12) one has 
: kK Fi.) ky 
Ni;nt - Neen AA ( hk;t ~~ lk ;h AA hl;k 


by equation (3). By means of the equation‘ 


-— Ip 
Nini = Nien = NR sini 


equation (15) becomes 


NVR yini = A*Paaz- 
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Multiplying by g" and contracting for 7 and J, one has, on changing dummy indices 


— Ry = NWP ayy. 
From equation (13) one has 
NR jy L/W F inj. 
Adding these equations, one obtains 
NF ay = 0. 
This equation is satisfied, in consequence of equation (14), when we take 


AF a AXpA;, 


where a is a constant, 


From equation (12) it follows that 
Nj, = 0. 
By means of equation (17), equation (13) becomes 
Rs aN jAn, 
and in consequence of equation (14 
R q "R h 0. 
From equation (19) and equation (12) one has 


R?). a(r’r,) 37 
a(And?.; + Arg ;*) 


in consequence of equation (18). 
Thus R;,, given by equation (19) in terms of the vector \, defined by equation 
(12), satisfies the fundamental equation® 


R?y.5 — olen 
As another approach to the problem, we take 
Nis 2 Ad’ Fry 
in place of equations (12). If we put 
Nr 
we have from equations (22) 
a, h 2(a — 1)MF y, 
which is satisfied by a = 1, and accordingly we take 
N'A l. 
From equations (22) one has 
Ngat— Ae an = (Fak — Fuk in) + A" (P as i — Fy; n+ Pasi Fup 


which, in consequence of the identities (3) and the equation.‘ 
| | 
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Ni; ar ri; pn = NMR int, 


NR: = NMP Ps: a F ,,F i») “4. A APF: k + Pe: de 


When this equation is multiplied by g“ and contracted for 7 and J, one has 


—NRinp = MP uF + MPa ek + OP as i 
By means of equations (22), equation (11) is 
Rin = Fy at + ad (Pigs an + Fas 3). 
When this equation is multipled by \/ and contracted for 7, one has 
NRin = NF iP’ + /2'NF in; 

When this equation is added to equation (24), the result is 

NM Pare + Fis ¢ = 0. 
We take 

Fins k= AAO — And, (27) 


where @, is a covariant vector defined later. These equations satisfy the identities 
(3). 


By means of equations (27) and (23), equation (26) becomes 
(On — A»v‘O,) = F4; 
From equation (27) one has, in consequence of equation (23), 
Ft; ¢ = On — Ando. 
From these two equations it follows that 
és oe 0 
and 
A, — ApA‘O, 0. 
From equation (27) by means of equation (23) one has 
NF ins 5 = Aj(On — Andr'O,) = O. 
In consequence of this result, equation (25) becomes 
Ri = FyiFy', 
and one has 
g"R h 
From equation (30) one has 
Rn; 5 = Pi Pai + PY! PR ays 5 = PUP ag ; 


in consequence of equation (28). From equation (31) one has 
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Ryn = QF y3 PF". 
Accordingly the left-hand member of equation (21) is 
FHF}; P Nee Fyi5 a) = FUP yy; 4 
in consequence of equation (3). By means of equations (27) and (29), 


FF); ; = F?*(\,A0; — AjA Bn) 
= F),0, = F*,rA\'8, = 0. 


Hence equation (21) is satisfied, 
When equation (29) is written 


6,(6,,' — AWA) — QO, 


where 6, is the Kronecker delta, determination of the vector 6; in terms of the vec- 
tor, Az, is readily found, noting that in consequence of equation (23) the deter- 
minant of these equations is equal to zero. 

1 Cf. these PROCEEDINGS, 42, 249-251, 1956. 

2 L. P. Eisenhart, ““Non-Riemannian Geometry,’’ Am. Math. Soc. Pubs., No. 8, pp. 2, 3. 

8 Tbid., p. 5. 

*L. P. Eisenhart, Riemannian Geometry (Princeton, N.J.: Princeton University Press, 1926), 


p. 30. 
5 Thid., p. 82 


SINGULAR INTEGRAL EQUATIONS FOR DIFFERENTIAL 
FORMS ON RIEMANNIAN MANIFOLDS* 


By J. J. Koun 
DEPARTMENT OF MATHEMATICS, PRINCETON UNIVERSITY 
Communicated by N. E. Steenrod, June 11, 1956 


In the present note we obtain a necessary and sufficient condition for the regu- 
larizability of singular integral equations for differential forms on a compact 
Riemannian manifold. This result is applied, in a forthcoming paper by D. C. 
Spencer and the author, to a class of boundary-value problems for complex oper- 
ators. 

Notation.—Throughout this note the following notation will be used: M is a 
compact C“ Riemannian manifold of dimension m, 7, is the tangent space at 
x « M, the inner product and the norm on 7’, are denoted by wv and |u|, respec- 
tively, S is the tangent sphere bundle over M with fiber S, ¢ 7’, consisting of the 
unit vectors in 7',, A” is the bundle of complex-valued covariant skew-symmetric 
tensors of degree p, and r(x, y) is the geodesic distance between x and y in M. 
Finally, if NV isa C” manifold and B is a bundle over N, then '(B, N) denotes the 
C® sections of B over N. 

Consider the projection maps mm, m2: &K M — M; let A,’, A»? be the induced 
bundles over MX M defined by 7,~!A”, 12.~'A”, respectively.!_ We identify the 
restriction of A,” @ A»? to the diagonal of M XK M with A?@ A?. 
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Definition 1: An operator J on differential forms of degree p is called a singular 
operator if it can be expressed by 


oe gly), 


J ¢(x) E(x)eg(x) + J G(x, y)/ 
M 


where y « '(A”, M); &€T(A?@ A”, M); GeT(A,?@ Ay”, M X M — fdiagonal}); 
é(x)-¢(r) means the contraction of ¢ with the last p indices of £, that is, if & = 
ya a, ® B,, then é¢ b> (8,°¢)a,; the integral is taken in the sense of a principal 
k k 
value; and G satisfies the following conditions: 

a) For each ue T, — {0}, lim r(x, y) G(x, y) F.(u) (lim denotes the limit 


u le 
r—y ry 


as y approaches x along the direction of u); if ¢ > 0, then F,(cu) = F,(u). 

b) The map F: S— A? ® A? defined by F\S, = F,is aC” bundle map. 

c) J F, = 0. 

d) There exists h>0 such that if x, ye and if U is a co-ordinate neighborhood 
containing x and y then if we identify the co-ordinates on U with the co-ordinates 


they induce on 7, we have: 
r™—"r, y) G (ayy) — rea, y) Fy (ax y) «TT (Af @ Ab, U X U) 


where by x — y we mean subtraction with respect to the co-ordinates on UU’. 

We are concerned with the reduction of the singular integral equation J/g = a@ 
to a Fredholm equation. Let G? be the set of all singular operators on differential 
forms of degree p; then we have 

Definition 2: If J € G’, the equation J¢ a is regularizable if there exists an 
operator K ¢G? such that the equation K o.J¢ = Kaisa Fredholm equation which 
has the same solutions as the original equation. 

By applying? the composition theorems of Giraud,* we obtain 

THEOREM 1. G? is an algebra under operator composition. The subset S? of G?, 
consisting of those J ¢G? for which € = 0 and F = 0, ts a two-sided ideal of S?. 

Now for x « M and J €§? we construct a singular operator H,J on forms of degree 
p over T,. The important property of H,./ is that J is regularizable if and only if 
H,./ has an inverse for each x eM. H,J is defined by 

* F,(u, v) 
H.Joa(u) = &(x)so(u) + le * o(v), 
Jr; |lu — v 
where F,(u,v) €(A,? @ A”, T, X T; — | diagonal | ) and is defined by 
(F(t, 0)) (dh, «2, By, Os, «00; (F,(u — v))(a, — U, ...,@y — U, by — v,...,b,—v) 


where a;, b; « T,. We denote by 3,” the set of all operators on 7’, which can be 
obtained from the mappings /’, as above. Now let 5,” be the ideal of S? consisting 
of all operators with &(2) = O and F, = 0; then we have 

THEOREM 2. %, 7s an algebra under operator composition, and the map H,: SG? > 
KH,” ts a homomorphism of algebras with kernel S,?. 

By using the partition of unity, we obtain? the following theorem which shows 
that H, is onto. 

THEOREM 3. If F is a bundle map satisfying conditions b and c and if § « T(A? ® 
A”, M), then there exists J «SG? which corresponds to F and &. 
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Since the space 7’, has an inner product, we have an isomorphism 7', ~ 7',* 


® A,” and the space of linear 


which induces a natural isomorphism between <A ,? 

transformations A,” — A,’. This isomorphism provides A,” ® A,” with an 

algebraic structure. Thus §,’, the set of C” maps S, ~ A,’ @® A,?’, is an algebra. 

Note that S$,” can be represented as the algebra of matrices of C” functions on S,. 
Consider the map S,:3C,” > S,” given by 


(S,H,J)(u) = &(x) + 
T: 


Then a generalization of the classical properties of Fourier transforms gives 
THroreM 4. The map S,:%,” > §S,” ts an isomorphism of algebras. 
We now express S, in terms of spherical harmonics. If F, is given by the matrix 
(F’(u)), we S,, and if &(2) is given by (£”), then we obtain 


Turorem 5. If F’ = >> Y,," where Y,” is a spherical harmonic of degree n, then 
n l 
- 
(S_H,./) (uw) i+ > +, Y,%(u), 
n l 


whe re 
'(n/2) 
C{(m + n)/2] 

The analogue of the above theorem for singular integral equations on functions 
on the plane was first obtained by F. Tricomi.‘ 8. G. Michlin® has improved 
Tricomi’s proof and has made extensive use of the generalization of this result to n 
dimensions. ‘This generalization was first stated by Giraud.6 (See also Bochner,® 
Calderon, and Zygmund.’) 


To establish the above result, we apply? the Funk-Hecke’ theorem to 
(S,H,J)(u) = &@ + Sf R(wv) F,(v), 
Sz 
where 
— log » for wv > 0, 
R(u-v) 


| — log p for wv < 0, 


and we obtain 


1 
JS Ms) Cc, lir)(1 — 2’ 
eee 


where C,” is the Gegenbauer polynomial® defined by 
(l—22t— ??)’ = « S Cat" 
n=0 


and w,,-» is the volume of the unit sphere in Euclidean (m — 1)-space. 





Vou. 42, 1956 MATHEMATICS: J. J. KOHN 653 


Now we define the homomorphism Sy:S? ~ Sy’, where Sy” is the algebra of 
bundle maps of S— A? @ A”. Sy, is defined by 


SJ (u) (S,H,.J)(u), where ue S, CS. 


Then 

THreoreM 6. The equation J¢ a can be regularized if and only if SaJ has an 
inverse in Say”. 

In the above-mentioned paper by D. C. Spencer and the author the following 


operators are introduced: 
d,g = '/2(r VY —1)d¢e + '/(7 + VV —l1)d¢ 


where ¢ is a differential form on a finite almost-complex Kahler manifold V and + 
is a point on the Riemann sphere. We set d. = d and we have d,=j = 0. Now 
the Poincairé-Fredholm method for the solution of the Neumann problem for d, 
with the boundary conditions ng and nd,g prescribed on bV, the boundary of V, 
leads to the introduction for xeV of the integral 


A(x) Sus [d,go(r,y) A *aly go(x,y) A *B(y) | 


where go is a fundamental singularity (see Spencer") for the operator 1, in some 
domain containing the closure of V. We evaluate the limits of nA (ax) and nd,A (xr) 
as x approaches the boundary. Thus we arrive at a system of integral equations 


By calculating the high-order terms in the kernels of J, we find that J, is a singular 
operator for 7 # © and an ordinary integral operator for 7 o,. Applying 
theorems 5 and 6 we obtain: 

THeoremM 7. The operator Jr is regularizable if and only if reZ, where Z 
1 7| Re(r) 0 and Im(r) <1}. 

An example of K. Kodaira shows that for reZ an eigen value of J7 can have 
infinite multiplicity. 


* This note pertains to work done while the author was a National Science Foundation fellow. 
‘'N. E. Steenrod, 7'opology of Fibre Bundles (Princeton, N.J.: Princeton University Press, 
1951) 
2J. J. Kohn, “‘A Non-self-adjoint Boundary Value Problem on Pseudo-Kahler Manifolds’’ 
(thesis, Princeton University, 1956 
G. Giraud, “Equations a intégrales principales,’ Ann. Ecole norm. supér., ser. UI, 51, 231-372, 
1934, 
‘ F, Tricomi, ‘‘Equazioni integrali contenti il valor principale di un integrale doppio,”’ Math. Z.., 
27, 87-133, 1928. 
5§. G. Michlin, Singular Integral Equations (Am. Math. Soc. Translation No. 24 [New York, 
1950] ). 
6 G. Giraud, ‘‘Sur une classe générale d’équations a intégrales principales,’’ Compt. rend. Acad. 
sci. (Paris), 202, 2124-2127, 1936. 
7H. Bateman, Higher Transcendental Functions (New York: McGraw-Hill Book Co., 1953). 
*S. Bochner, Theta Relations with Spherical Harmonics, these PRocEEDINGS, 37, 804-808, 1951. 
» A. P. Calderon and A. Zygmund, “On the Existence of Certain Singular Integrals,’’ Acta 
Vathematica, 88, 85-139, 1952, and ‘On Singular Integrals,’’ Am. J. Math., 78, 289-320, 1956. 
© T). C. Spencer, ‘Potential Theory and Almost-Complex Manifolds,’’ Lectures on Functions of a 


Complex Variable, Ann Arbor: Univ. of Michigan Press, pp. 15-43, 1955. 





MEASURES OF THE VALUE OF INFORMATION 
By Joun McCarruy 
DARTMOUTH COLLEGE, HANOVER, NEW HAMPSHIRE 


Communicated by Claude Shannon, July 12, 1956 


1. Introduction.—Our knowledge of a future event may take the form of a set 
of probabilities p; Pn. For example, we might have probabilities of */s, '/s, 
and '/» for rain, snow, and clear as tomorrow’s weather. In communication theory 
our interest is in the various events only as carriers of a coded message. For this 


purpose Shannon’s! entropy — >>p; log p; is the appropriate measure of our uncer- 


tainty, and a function A >>p; log p; + Bis a good measure of what it is worth to be 
given these probabilities. In our weather example we care which event occurs. 
Furthermore, we may be more interested in whether the sky is clear than in whether 
rain or snow occurs if the weather is bad. In this paper we show that any convex 
function of a set of probabilities may serve as a measure of the value of information 
and that two such functions are equivalent in an appropriate sense if and only if 
they differ by a linear function. 

2. The Forecaster and His Client.—We get our quantitative measures of the 
value of information from a situation in which a client pays a forecaster for pre- 
dictions of a future event according to the following rules: 

(i) The forecaster gives the client probabilities q, gn for the events, where 
ea = 1. 

(ii) The client takes action on the basis of these probabilities, and one of the 
possible events occurs. 

(iii) If the 7th event occurs, the client pays the forecaster fi(q, . ~~ , Gn), Which 
is abbreviated f;(q). 

(iv) We assume that neither the forecaster nor the client can influence the pre- 
dicted event, although the forecaster can make experiments to help predict it, and 
the client gets an amount which depends on both the action he takes and on the 
event which occurs. In what follows, it is assumed that the forecaster and the 
client both wish to maximize the expected value of their incomes. 

Assuming that to the forecaster the probabilities of the possible events are p, 

Pn, his expectation is >>p,f;(q) if he tells the client the q’s. A payoff rule is said to 
“keep the forecaster honest” if, regardless of the value of p = (p, Dn), the 
forecaster’s expectation is maximized if and only if he puts q = p, ie., gq; = p, for 
each 7. 

THEOREM |. A payoff rule keeps the forecaster honest if and only if f;:(q) 
(0/0q,)f(q), where f(q) is a convex function of q which is homogeneous of the first degree. 
The expectation of an honest forecaster is then >> pf i(p) = f(p). 

We omit the proof. The derivative has to be taken in a suitable generalized 
sense. f(q) is called a “payoff function” if it satisfies the conditions of Theorem 1. 

I. J. Good? considered the problem of paying the forecaster with the restriction 
that f(g) = F(q,), i.e., the payoff depends only on the probability assigned to the 
event which actually occurred. He showed that putting F(x) = A log x + B keeps 
the forecaster honest, and Gleason (unpublished) showed that this is the only F(z) 
which does. The forecaster’s expectation is then A op; log p; + B, ice., he is paid 
a fixed fee minus the expected uncertainty about the event after his prediction. 
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3. The Client’s Expectation.—Suppose that on the basis of the forecaster’s pre- 


diction the client chooses the jth of the actions open to him and that his payoff if the 
ith event occurs is a;;. His expectation will be g(p) max >> a,jp, if j is chosen 


optimally. 
From the theory of convex functions we have 
THEOREM 2. Any function g(p) defined for py = Pn = 0 which is convex 


and homogeneous of the first degree can be written in the form max >> ayp;. Unless 
} 


g(p) is piecewise linear, there will have to be an infinite number of actions j. 

If we put f(p) = g(p), the client is eliminated from the picture, since under this 
condition he turns all his gains over to the forecaster and is reimbursed for all his 
losses. This is not a satisfactory solution to the problem, so let us see what payoffs 
f are equivalent in their effect on the forecaster’s efforts to get information. 

1. The Forecaster’s Experiments.—Assume that the forecaster has a_ priori 
probabilities 7; r, for the events, that he has a choice of m experimental pro- 
cedures with expected costs to him of ¢, Cm, and that the conditional proba- 
bility of the kth outcome of the hth experiment given that the 7th event will occur 
iS S:,;. The experiment chosen by the forecaster will depend on the c’s, the s’s, 
and the r’s and on the payoff function chosen by the client. We call two payoff 
rules equivalent if, for any set of c’s, s’s, and r’s, they lead to the same choice of 
experiment by the forecaster. 

TueoreM 3. f(q) and f*(q) are equivalent if and only if f(q) = f*(q) + dca, 
1.e., f the two payoff functions differ by a linear function of the q’s. 

The proof is omitted. If f and f* are equivalent, then f;(q) = f:*(q) + aj, so that 
the payoff rules differ by an amount which depends only on the event which occurs 
and not on the forecaster’s prediction. The forecaster’s and client’s interests will 
be identical if we put f(g) = g(q) + doa: The a;’s are subject to negotiation be- 
tween the client and the forecaster, and they determine both a base level of pay- 
ment and also a betting relation between the client and forecaster. If f is normal- 
ized so that f(1,0,...,0) = f(0, 1 0) = ..., the payment for a precise cor- 
rect prediction is independent of the event predicted. 

5. Conclusion.—The foregoing analysis shows that any convex function of a set 
of probabilities will, under appropriate circumstances, be a measure of the value of 
the information contained in a set of probabilities in the sense that it is an appro- 
priate payment to a forecaster who furnishes the probabilities. 

The intuitive content of the convexity restriction is that it is always a good ides 
to look at the outcome of an experiment if it is free. For suppose that the experi- 
ment has two outcomes, A and A*, which would give one probabilities p and p* for 
the event in question. Let ¢ be the probability that A is the outcome. If we de- 
cide not to look, our expectation is f(¢p + (1 — t)p*), while if we decide to look, our 
expectation is tf(p) + (1 — bf(p*). 

Finally, we remark that there are yet more general ways of paying the forecaster. 
For example, the client may agree to pay a certain fraction a of the costs of ex- 
perimentation. Then the payoff function can be scaled down by a factor a with 
the identity of interests still preserved. We hope to treat these matters on an- 
other occasion. 

1 C, E. Shannon and W. Weaver, The Mathematical Theory of Communication (Urbana: Uni- 


versity of Illinois Press, 1949). 
2 T. J. Good, “Rational Decisions,’’ J. Roy. Stat. Soc., B, Vol. 14, No. 1, 1952. 





DIOPHANTINE EQUATIONS IN CERTAIN RINGS 
By H. 8S. VANDIVER* 
UNIVERSITY OF TEXAS 


Communicated July 12, 1956 


Let p be an odd prime and Fp” denote a finite field of order p”, and consider 
the solutions of the conditional polynomial equation 


(1) 
where the coeff cients are in F[p"] and s > 1 if the equation contains a non-zero 


constant term; s > 2, if such is not the case. An equation which has been ex- 
tensively studied, particularly in recent years, is the special case 


Cty + cote" +... + ,2," + ce = 0, (2) 


where the a’s are integers such that 0 < a < p” — 1;8s > 1 fore,,; # Oand s > 2 
forc,.; = 0, the c’s being given elements of F'[p”"] and 


Oa: xine ee, (2a) 


In the present paper we consider the application of the theory of the equation (2) 
to the derivation of conditional equations with coefficients in R which have no 
solutions in R. Here R denotes a commutative ring with a unity element. When 
R is the rational ring, it was shown in several other papers ? how to derive certain 
classes of equations of this type, and in the present paper we shall use extensions of 
these methods. 

In the case of the applications to the rational ring, we first employed the con- 
gruence 


Cay + cote? +... + ets + Cy =O (mod p), (3) 
with the c’s rational integers, ¢c; . . . c, # 0 (mod p); s > 1 for ¢,,; 4 0 (mod p) and 
s > 2 for c.,, # 0 (mod p). Now we let R be a commutative ring containing a 
unity element and, analogous to equation (2), consider the congruence, in the 


unknowns @), we w,, each € R, 


oir! + ome” +... + os" + On FO (mod p), (4) 


where p is a prime ideal in # of definite norm p", p an odd prime, the norm of p is 


the number of distinct elements in the residue class ring of p, and 
0102... OpW1W2... 0, # O (mod )). (4a) 


Also, s > 2 if o,,; = 0, and s 2 if o.4, ~ 0. The ideal p being a prime ideal in 
R, the residue classes modulo p form a finite field F'[p”], since a finite integral domain 
is a field. Suppose that it is possible to show that congruence (4) has no solutions 
which satisfy condition (4a). Assume, further, that it has been determined that 
(4) has no solutions even when we allow any of the x’s to be =0 (mod p), aside from 
the trivial case where every x has this property, with o,,; = 0 (mod p). Now 
consider the Diophantine equation in R, 
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(0) 


where 8 0 (mod p), 2 By eee ._y £0(mody). Modulo py, this reduces 
first to 


VOss1 + p: YO ipi 0 (mod py), (5a) 


a 
i=l 


and since y # 0 (mod p), we obtain relation (4), which we are assuming has no 
solutions; hence equation (5) has no solutions in the y’s ¢ 2. 

In what follows, we shall show that this principle may be applied to many classes 
of Diophantine equations of type (5) and also related types. 

If the laws of addition and multiplication are defined in R, then, if we except the 
zero residue class modulo p, the other residue classes form a cyclic group of order 
p” — 1 under multiplication generated by an element @ « 2, and 6 may be determined 
in a finite number of steps. 

An example of the type of ring we have called F in the above is the ring of integers 
in an algebraic field, since in this ring every ideal has a finite norm. 

As the residue classes modulo p in congruence (4) form, as we have noted, a finite 
field, we may replace congruence (4) by an equation in a finite field such as (2) and 
study the number of solutions of (2) to obtain the number of solutions of congru- 
ence (4). Assuming that the 2’s in equation (2) are nonzero in F'[p"], in another 
paper* we employed g, a generator of the cyclic group formed by the nonzero ele- 


ments in the finite field F'[p”"]; if c..1 # 0, we replaced equation (2) by 
l + E(e, T QQ) a EK(e, tT @;QM;) 0, (6) 
where E(t) = g’, and if ¢,,; = 0, equation (2) was replaced by 


E(e, + aya,;) +... + Ele, + a,a,) = 0, (6a) 


provided that in equation (2) x; ¥ 0, 7 1,2,..., 8. Let (p* — l, a,) = d,, 
i= 1,2,...,8. Now the a; in equations (6) and (6a) may be taken in the set 


The number of solutions of equation (2) for nonzero 2’s is equal to the number 
of sets of values ay, a2 a, in equations (6) and (6a). Formulas were employed 
which showed that this number of solutions could be determined® (p. 50) directly 


if the number of sets of the solutions aj, a: in F[p”| of 
E(u, + aym) + E(u, + am) + 1 0 


was known for any integers uw; and uw = 0, where m is the L.C.M. of dh, da, .. . ds. 

As m is the L.C.M. of the a’s in congruence (4), the simplest case to consider 
appears to be the case when the a’s are all equal. In an earlier paper,‘ referred to 
above, this case was considered, and also the case when RF is a rational ring and the 
congruences have no constant term. The results were extended by Ankeny.° 
The methods employed in these two papers do not, however, apply to Diophantine 





ea icy lla oF 
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equations with non-zero constant term, whereas the methods we are describing 
here do apply, to some extent, to this case. 

[t is possible to obtain non-solvable sets of equations of type (7) with little of the 
element of trial involved. If we consider the equation 


aya” + Bor” + y3; = 0, 
with a, 82, and y; each e R and the corresponding congruence 
L + a2," = Biz” (mod p), (9) 
then it is known that if we use the last congruence in the form, equivalent to (7), 
L+ ot? ager (mod p), (10) 


with 6 defined as before, and if [a, b],, denotes the number of solutions (7, s) 
in equation (8), then, if p" = 1 + dm and if d < m, we may use the known relation 


m~ 


1 
>: [a,b], =d or d —1, (11) 
g=() 


‘ 


so that at least one of the [a, b]’s must be zero for each b. 

To examine the above possibility, we may set up a table giving the values of the 
(a, b]’s in (10) for each a and 6; then, to obtain the number of solutions of con- 
gruence (4) with each w # 0 (mod p), we employ the formulas® which give the 
number of solutions of (4) under conditions (4a). (This method had already been 
described in the second reference in our earlier paper! for the case of R, the ring 
of rational integers.) In the case in which the number of solutions turn out to be 
zero, we may have an application to Diophantine equations in R, if we find, upon 
examining further the number of solutions of (4), where some of the w’s may be = 0 
(mod p),iszero. Thisrequires the consideration of a set of congruences obtained from 
(4) by substituting zero for each w in any combination of w’s and determining the 
number of solutions, with no w congruent to 0 (mod p), of each of the resulting con- 
gruences in the various w’s remaining. If it turns out that none of these con- 
gruences have any solutions in R, except for the trivial case o,,; = 0 (mod p), 
and all the w’s are zero, then it follows that none of equations (5) have any solu- 
tions. As we see from the proof of Theorem III of a previous paper,* when we were 
considering the equation in the ring of rational integers, and also relation (11) of our 
present paper, we are not likely to be able to set up many types of equations without 
solutions of type (5), with each a = m, unless in the relation p” — 1 = dm, m is quite 
large with respect to d. 

Elsewhere ' ? the congruence 


1 + ax’ = by’ (mod p) (lla) 


was treated in the case in which it is conditional in x and y, p is an odd prime, 
ab # 0 (mod p), and J and ¢ are integers each greater than 1 such that le = p — 1. 
By setting up tables! ? of the possible solutions of these congruences for a given 
p, it was shown how to employ these to derive nonsolvable equations of the type 


1+ ary = by;'. (11b) 


In a similar way we may use the congruence 
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L + Oyu = dope! (mod p), 


where 6;, 62, 41, w2 are elements in FR, and p, as before, is a prime ideal in R such that 
N(p) = p" 1 + el, with 6 defined in R as before and assuming 


b,Oouiuo # O (mod yp). (13) 
We may replace (12) by 
= pits (mod p), (14) 


with 7 in the set 0, 1 — 1; sin the set 0, 1 ce — l,i and 7 fixed with 
0<5:1sc¢c¢-—-1;08j7 Sl— 1. The equivalence of the two congruences (12) 
and (14) follows easily from the properties of the 6’s. We shall denote the number 
of solutions r, s in the last congruence within the limits mentioned [7, 7]... Since 
the residue classes in R modulo p form a finite field, we may use the following known 
relations: If lis odd, 


and if l is even, 


and also 
+. .7# 6. a 
a ie ti J (17) 
a il ff 73 =0. 
Also, in testing (14), suppose that we find, for a particular 7, 7, that [7,7] = 0. To 
apply this result in order to obtain Diophantine equations with no solutions, we 
have to test the possibility that (12) is solvable for either 4; = 0 (mod p) or pw: = 
0 (mod p), assuming 


3, = 6! and 5. = (mod p) 


in (12). For this we have the following criterion: 

If (ui, p) = Pp, (ue, p) = 1, then (12), where p” — 1 = cl,i = 0, 1 > 
andj = 0, 1 — 1, has solutions if and only if 7 = 0. If (uw, p) = p, and 
(u1, p) = 1, then (12),2 = 0, 1 c—l,and7 = 0, 1 1 — 1, has solutions 
when / is odd if and only if 7 = ¢/2; when 1 is even, if and only if 7 = 0. 


Based on the tables of the [7, j|’s calculated," ? about 25 per cent of the con- 


gruences (11a) for a given p and / led to impossible equations of type (110). 
In order to apply certain formulas which have been published elsewhere, we 


employ (14) in the form 
gre: 4 grin = | (mod p). (18) 


If (2, j1)-: denotes the number of solutions s;, ¢; in the above, then, obviously, 
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(i + €& Jer = [tJ ]er, (18a) 
where « = (p” — 1)/2. 
We shall now examine the question as to the number of (7, 7)’s which are zero 


for a given p,c, andl. To effect this, we shall employ the formula‘ if b # 0 (mod 


c), with le + 1 = p",t 


cont 
2. (d,j)(d,j —b. + 1 (19) 


7=0 


where in the summation on the right-hand side of this equation d ranges over the 
multiples of J in the set 0, 1 1. Let l be odd; then, in view of (15) and 
(18a) we see that for 7 = 0 we have 


I—1 
> (0, der — 1. (19a) 


— 

i=0 
Consequently, at least one of the summands on the left equals zero. For the cases 
where b # 0 (mod c), we may use formula (19), and if we assume that none of the 
(7, j)’s under this assumption is zero, it follows that each must equal unity; hence 


(19) gives 
(d, h),-(d, h oe b). 0 


for each h in the set 0, 1 *— 1, and dis any multiple of J in the set 0, 1 
v — 1;b any integer in the set 1,2,...,¢ — 1. 
In equation (20) for a given d there is just one (d, h’),. which equals unity; hence 


(20) gives 
(d, h’ — b), 0) (21) 


Then, if we assume d # O (mod c) and employ (17), 
we have 


? (d, hy — b), l — (d, hy’). 
az -l-1 


which contradicts (21). The above argument was made under the restriction 
that / was odd. If it is even, the same proof goes through with slight modification. 
Hence we conclude that there are always two of the (7, 7),:’s that are equal to zero, 
and consequently also two of the [z, j|’s. However, induction indicates in the 
case of n = 1 that about one-third of the [2, 7]’s are zero.” 

Elsewhere! ? we have mentioned the problem of determining the primes p in 

(14) with p = le + 1 such that [7, 7] = l for some 7 and 7 with / fixed, which we 
called “Case A” of (14). Also, we noted the existence of seven primes p with / 
5 such that [7,7] = 5. But we noted that for 1 = 7 there was no prime p < 2003 
such that Case A held for any particular? andj. Jt may happen, of course, that since 
c increases with p, for p sufficiently large we shall always find an |i, 7] l. On the 
other hand, we know that,’ if in Case A, [b, h]., = l for some h with a fixed b ¥ 0 
(mod c), l odd, 
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for d # 0 (mod /), with d in the range 0, | ; p — 1, @ any integer in 
the set 0, 1 *— 1. So the number of these criteria also increases with p. For 
the case when 1 = 3, Case A occurs when p 103, 109, 127, 139, 163, and it occurs 
more than once for each of these primes. However, the case 1 = 3 may be excep- 
tional, at least as far as numerous occurrences of Case A are concerned. Often, 
in dealing with trinomials, if we are using some prime parameter in our investiga- 
tion of them, the prime 3 is exceptional. 

In other papers the writer has used the term “Case B” to indicate the case when, 
for 7 fixed, each one of the (7, j).:’s is unity. This situation has already arisen 
in the present paper in connection with (18), and in view of application of (19) 
when each (5, 7) | (and therefore for each [b, 7]), we may then state: 

If bis fixed and #0 (mod ce), then, if Case B holds, relations (20) follow. 

There is a connection between Cases A and B of the [{7, j]..’s, which we shall now 


explain. Consider the cyclic group formed by the 6’s in (18). If ¢ is prime to J, 


it may be represented as the product of two cyclic groups of orders ¢ and 1, gener- 


ated by 6’ and 6. In view of this, we may replace equation (14) by 
1 + 679°! = pro" (mod p), (22) 


with 2, al (mod ¢), j; = be (mod J), where ¢ is in the range 0, 1 1, and 


where u is in the range 0, | >— 1. Clearly [2,7]. lal, be|.;. Suppose that 


lal, be|., = d in congruence (22); in view of this we are led to the congruences 
1+ gee = 9p (mod p), (23) 
where i ,2,...,d, and no two of the ?’s are congruent, modulo/l. Division gives 
1 + 9779 = 90 —@<gilue—a) (mod p). (23a) 


Now in the exponent (b — ¢,)e on the right, since the ¢’s are incongruent modulo J, 
then so are the (b — ¢,)’s.. We see from this that the value [—al, be],.,; may be, in 
general, quite different from [al, be]. In particular if d = k = 1, then the (b — ¢,)’s 
are congruent in some order, modulo /, to 0, 1 , so that we have: 

If Case A holds for a given i in equation (14), then Case B holds for (—7) in lieu of 
2, if cis prime tol. 

The converse of this statement does not hold. The argument just given, how- 
ever, also shows that Case A never arises for 7 = 0 in equation (14). 

For c = 0 (mod J), we note that (14) gives 


bE Or as F tgs (mod yp), 


and it is easy to see then that the result stated above holds in the case ¢ = 0 (mod 1) 
also. The situation when c = 0 (mod /,), 1 < l, < ¢, is more complicated. 

We have already referred in this paper to the application of a nonsolvable con- 
gruence of type (lla) to deriving nonsolvable equations of type (11b). In the 
case in which / = 2, these ideas may be put in a very simple form. We shall con- 


sider the congruence 
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— | 
a+ be“ =ky? (mod p); a, (24) 


If xy # 0 (mod p), abk # 0 (mod p), and, using Legendre symbols, 


(“ - a (f + ) 
~ I, ¥~ 1, 
Pp p 


then (24) has no solutions, under the conditions mentioned, as we see by using 
x“ = +1 (mod p). If we now consider the possibility zy = 0 (mod p), this is ex- 


ak 
( ) x 2 (26) 
p 


at+b#0 (mod p). (27) 


cluded if we assume 


and 


Consequently, under conditions (25), (26), and (27), congruence (24) has no solu- 
tions whatever in integers. Hence the equation 
a+ bx," = ky? (28) 

has no solutions in integers.® 

Assuming p, l, and ¢ fixed in congruence (24), we may take a, b, and c each in the 
set 0, 1 p — 1, and classify all the impossible congruences derivable by this 
method, in a finite number of steps. This is another form of the process we used of 
setting up tables for a given p of the impossible congruences of the type (24), ex- 
cept that, in the second process, primitive roots of p were employed; but this is not 
necessary in the first method, in which we may employ the law of quadratic rec- 
iprocity conveniently. 


We will now consider congruence (24) again, but from a different point of view. 


We can regard a, b, and k as fixed and let p vary. We can then note that there is 
an infinity of primes p fitting conditions (25) and (26), with a, b, and k fixed and 
provided that none of the quantities 
(a — b)k, (a + b)k, ak (29) 
is the square of an integer and also that the product of no two of these quantities 
has this property, as well as the product of all three.? As far as I know, the known 
proofs of this result are not constructive, and we do not know of any way of deter- 
mining such primes in general; but for some small given values of a, b, and k we 
may determine some primes of the type desired, using the law of quadratic re- 
ciprocity. 
The ideas just explained may be extended to the examination of the congruence 
with (a, b) = 1, a and b integers, 
a+ bat =y/' (mod p), (30) 
with p an odd prime, likewise /, and the equation 


a+ ba® = y', (31) 


a, b, and l given, ab # 0 (mod p). 
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Set, if N(p) is the norm of p, 


w \ 
( ) é * (mod p), 
p/, 


where w is an integer « K(¢), an algebraic field defined by ¢ = ¢ 
in K prime to (wl) and which divides (p). By a theorem given by Takagi," there 
exists an infinity of prime ideals p of the first degree « K such that, if (a’ + b’), 


2ix/l 


,pisa prime ideal 


and (pl) are prime to each other, 


a + b¢' 


(“*) = or (= pan = { osm (32) 
D/1 p 


where the a’s are prescribed integers in the set 0, 1 1 — 1, and m is some in- 
teger # 0 (mod J), provided that, if the relation 


then 


mo my am”! i ton y', (33) 


a ay o +s Cie) 


with y an integer in K, and the m’s integers, then m; = 0 (mod 1), 7 = 0, 1,... 
holds,! — 1. Wethen select py so that ab # 0 (mod py). 

We now consider congruence (30) and assume in equation (32) that none of the 
a,’s,t = 0,1,...2— 1, isdivisible by l. Let @ be the generator of the cyclic group 
of the residue classes (excluding the zero class) modulo p under multiplication. 
Then we may select 6 so that 6° = ¢ (mod p). Hence we may obtain from con- 
gruence (30) 

a + be = ;/' (mod p), (34) 


(mod p), 


(“ + ©) ' 
p l 


for some h, if congruence (34) is satisfied. However, if we select, as above, the 


whence 


a’s to satisfy equations (32) with none of the a’s = 0 (mod J), this shows that con- 
gruence (30) is impossible for zy # 0 (mod 1). As to the last restriction, this can 


be removed if we assume that 


a 
( ) 1: a' + b' #0 (mod p), (35) 
p/i 


which is always possible if a is not the /th power of an integer; consequently equa- 
tion (31) has no solutions, provided that we can show that m;= 0 (mod J), 7 = 0, 1, 
...,— 1, in equation (33) and also that conditions (35) are satisfied. As to the 
first condition, we note that a + bg‘ is prime to a + b¢’, i # 7 (mod J), as, if they 
have a common prime ideal factor q, then, if a; is an ideal in K 


b(¢ — &) =0 (mod q), 
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or q divides (1), contradicting (a' + b', 1) = 1, noting also that b # 0 (mod p) is 


> 


impossible, since (a, 6) = 1. Hence equation (33) gives 
(a + be)” a;, 


by the unique factorization of ideals. 
If m; # 0 (mod 1), set mim,’ 1 + kl; then 


F sf\ mim’ ; +kl 
(a + be’) : =(a+ be’)! a; 
(a + be") ay, 


(a + be") b,!. 


If ¢ 0 (mod 2) in equation (36), we may replace ¢ by ¢* for s 


and multiply together the resulting equations, to obtain, if a # v, 


a' + b! 
: a’, 
a+b 


Hence mj, me, ... m,—; are all zero unless equation (37) holds. If my ¥ 0, then we 
have 
a +- b = t!. 38) 
It then follows that unless one of the two equations (37) or (38) is satisfied and 
every m in (33) is zero, we may select the a’s in equation (32) to be all prime to l. 
We then have the following theorem. 
THEOREM. The equation 


da + kyp + (db + kep)x® = (d + kap)y', (39) 


with (a, b) 1, a, b, and l fixed, is impossible for an infinity of integers ¢ such that 
cl + 1 p, with p an odd prime provided that lis an odd prime, 


x rl a+b <x s!: a x ft. 


a+b 


and a, b, r, s, t are intege rs, d # 0 (mod Pp). 


* Part of the work on the present paper was done under National Science Foundation Grant 


G1397. 
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A FUNDAMENTAL THEORY OF SUPERCONDUCTIVITY 
By KENNETH S. PITzER 
DEPARTMENT OF CHEMISTRY AND CHEMICAL ENGINEERING, UNIVERSITY OF CALIFORNIA, 
BERKELEY, CALIFORNIA 


Communicated June 6, 1956 


Although phenomenological theories of superconductivity are now well developed, 
there is still need for an adequate fundamental theory based directly on the quan- 
tum behavior of the electrons and the lattice. Recently Schafroth' has shown 
that a Bose gas of charged particles would show the Meissner effect. Also, Bar- 
deen? has shown that a separate particle system where all excited electronic levels 
are separated from the “surface of the Fermi sea” by a finite energy will show the 


Meissner effect. 


Our analysis of the problem will be more descriptive than mathematical and will 


be similar in many respects to the theory of liquid helium 1.’ 
The key feature is the finite binding energy of electron pairs or, more precisely, 
the finite energy required to excite the electronic system in modes other than 


translational flow of the entire system. 
I 


Our general knowledge of chemical substances leads us to expect electron pairing. 
In molecules chemical bonding is strongest for completely paired electrons in all 
but the rarest cases (such as O.). We normally find unpaired electrons associated 
only with atomic inner shells, which are not involved in bonding such as in the rare 
earths. Metals at room temperature are indeed an exception to this general situa- 
tion (although the fraction of unpaired electrons is infinitesimal). Simple theories 
of metals which are successful in many other respects fail to predict supercon- 
ductivity; however, these theories fail to take adequate account of coulombic re- 
pulsion of electrons and make the assumption of a perfect lattice of stationary nu- 
clei. 

It is known‘ that lattice distortions tend to produce breaks in the energy-level 
bands. If a single distortion lowers the electronic energy sufficiently (more than 
the energy of zero-point vibration in the corresponding mode), then permanent 
distortion occurs. Even for a lattice which is perfect in the mean location of the 
atoms, however, the total energy must be averaged over the various distortions 
which arise from zero-point vibration. Thus, while the electronic state with com- 
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plete pairing of spins may be stable by only an infinitesimal margin for a perfect 
lattice, this margin may be expected to become finite when averaged over the distor- 
tions of the lattice. 

Let us also discuss this question in terms of Pauling’s’ theory of metallic binding. 
One notes that there are a number of locations in the lattice that are appropriate 
for chemical bonds, i.e., the shortest distances occurring between atoms. In a 
metal the number of these bond sites exceeds the number of bonding electron pairs. 
This is in contrast to a crystal such as diamond where the number of sites is pre- 
cisely equal to the number of bonding electron pairs. 

In addition to the bond sites, one must consider the available atomic orbitals 
suitable for bonding electrons. While almost always a pair of suitable orbitals can 
be found for each bond site, there are limitations on the number and arrangement 
of orbitals which may be occupied simultaneously. Thus in graphite, while the 
2p orbital perpendicular to the plane is appropriate for the formation of a m bond in 
any one of the three bond sites adjacent to the atom, only one of these bonds can 
be formed at a given time. This limitation is important and, while less restrictive 
in true metals, must be kept in mind. It will be more restrictive in metals where 
the ratio of electrons to valence-shell orbitals is high. Many such metals show 
superconductivity, while those with a low ratio of electrons to orbitals do not. 

Pauling now constructs wave functions in which the electrons occupy these bond 
sites and bond orbitals either in pairs or singly. Coulombic repulsion between 
electrons is very strong; consequently the set of sites occupied in a low-energy 
wave function must maintain uniform charge density. The true wave function is 
taken as the optimum linear combination of these component functions (in the 
manner of quantum-mechanical resonance). The lowest energy is obtained if the 
electrons, which occupy orbitals singly in one component function, have spins 
paired, because the resonance may then include other functions with these elec- 
trons in a single bond orbita'. 

Such a system of electrons is free to move collectively through the lattice and 
thus conduct electricity. Symbolically one may write a possible wave function 


for a flowing metallic electronic system: 


VW = exp tk-(>-R)), (1) 


where ® is the wave function of the system in the absence of flow and the R, vectors 
locate the electrons. Feynman* discusses this type of function more fully in 
connection with He mu. The question of resistance, i.e., energy loss, from such a 
flow will be considered later. 

Now let us recognize that the lattice is vibrating with zero-point amplitude 
supplemented by any lattice thermal excitation. As the lattice oscillates, some 
bond sites will become of more nearly ideal length than others. Also, the angles 
change, and it will be more favorable in terms of atomic orbitals to occupy certain 
sets of bond sites than others. The electrons will tend at all times to occupy the 
more favorable bond sites for a large fraction of the time. 

We now consider the change when one electron spin is reoriented so that there 
are two more spins of one sign than of the other. These two electrons now require 
separate orbitals at all times, whereas before they could occupy a single orbital. 
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ven in a perfect, stationary lattice it seems likely that this change may some- 
times cause the use of a less favorable orbital and a finite increase in energy. But 
in a distorted lattice the new state clearly requires the use of a less favorable orbi- 
tal, because, in order to maintain uniform charge distribution, the additional orbital 
must be selected within a very limited region of space. Thus we conclude that un- 
pairing two electron spins will cause a finite increase in energy in a vibrating lattice. 

The writer does not claim to have proved rigorously this essential point, but he 
does believe that the best available theoretical arguments strongly indicate the 
correctness of this finite excitation energy. Also, he is aware of no disproof. 

While we shall not discuss the transition to the normal state in any detail, it 
seems safe to assume that the transition temperature is of the order of magnitude 
e/k, where ¢ is the energy required to unpair the first spin. We shall give some 
explicit support for this relationship later. For a rigid perfect lattice we believe 
that this energy may still have a nonzero value e., in at least some substances. 
In all cases, however, we expect the electronic energy to rise quadratically with 
lattice distortion and to rise slightly more steeply when a spin is unpaired. Thus 
we expect a term ¢(/) proportional to the square of the amplitude of the lattice 
vibrations or to M if .W is the atomic mass. 

Thus we have 


eT See € - «(M) e. + (const.) M : (2) 


If «., is negligible, one expects 7’, to vary with ./~ *, as was observed for mercury.® 
In the case of tin’ the reported exponent of 7 is —0.462 + 0.014, which deviates 
from '/, by about three times the stated error. This offers some indication that 
e.. may be significant for tin. 

Thus the isotope effect is satisfactorily explained in this theory. It is also in- 
teresting to note that these energy terms are very small in comparison with the 
total zero-point energy of lattice vibration. Consequently no large difference in 
lattice constant, elastic constants, or lattice specific heat is expected between normal 
and superconductive states. 


Next let us consider the possibilities of low-energy thermal excitation of the 
completely paired electronic system. Pines and Bohm* have shown that density 
oscillations of the usual phonon type have a very high energy in fluid of electrically 
charged particles and that they will not be excited in the electronic system of metals 
at low temperatures. The phonons in a metal are to be regarded as lattice motions. 
The accompanying electronic adjustment for lattice phonon motion is exactly 
similar to that for lattice zero-point oscillations. The electronic system is still in 
its ground state. 

We have already mentioned the possibility of translational motion of the entire 


electronic system with respect to the lattice. This requires, of course, that the 


superconductor be appropriately connected for current flow. There is also the 
possibility of nonuniform flow, provided the flow rate varies only slowly. 

Since all density oscillations are of high frequency and high energy, we may 
conclude that, as long as all electrons are paired, the only low-energy motion is the 
translational flow just mentioned. This is such an important point, however, that 
we shall attempt to show more explicitly the absence of other low-energy states. 





PHYSICS: K. 8S. PITZER Proc. N. A. S. 


By complete electron pairing we mean, of course, that there are equal numbers of 
electrons with positive and with negative spin. Moreover, we expect that the 
wave function will be large only for electron locations which constitute an es- 


sentially uniform distribution of electrons of each spin as well as uniform distribu- 


tion of total electronic charge. 

Now let us suppose that an electron of positive spin moves slowly from a region 
A to another region B. In order to maintain uniform charge density (in the ab- 
sence of net flow), another electron must be transferred from B to A. If the 
second electron also has positive spin, we do not have an experimentally distinguish- 
able state. We have merely the exchange of two electrons, which leaves the wave 
function unchanged except for a reversal of sign. This is just the property of the 
ground state; we have no new state. 

The other possibility is that the charge compensation occurs through the motion 
from B to A of an electron of negative spin. Now we have in region B an excess 
of two positive spins and in region A an excess of two negative spins. Provided 
that regions A and B are sufficiently far apart, this is clearly a new state. Its 
wave function can be orthogonal to that of the ground state. But its energy will 
exceed that of the normal state by approximately twice the energy of spin unpairing 
together with any excess electronic kinetic energy. Consequently it is not a low- 
energy state in the sense of having lower energy than that required to unpair spins. 

It is difficult to predict whether there will be states of the type just described 
where the energy is not sufficient to “dissociate” the regions A and B of excess 
positive and negative spin. However, detailed consideration indicates that the 
energies of such states almost certainly must exceed that of a state with a single 
pair of electrons of parallel spin. Consequently this question is of secondary 
importance. 

We now conclude that in the first approximation the low-energy excited elec- 
tronic states (other than those for current flow) may be described in terms of un- 
paired electron spins. If the spin of one electron is reversed, then in the region 
immediately around it there will be an excess of two electrons of a given spin. 
These two electrons may then separate to form two regions with single excess spin. 
Each region may move through the lattice. Thus the location and motion of each 
region of excess spin comprise observable features which can characterize a large 
array of excited states. 

The momentum associated with such excitations can interact with the lattice 
phonons of appropriate wave length. Thus the electronic specific heat of the 
superconductor will be in thermal equilibrium with the lattice. On the other 
hand, there does not appear to be any mechanism whereby lattice oscillations can 
directly create such electron-spin excitations. They must be created at a surface 
or an interior defect or may be transferred into a superconductive region from a 
nonsuperconductive region of the metal. This picture is in accord, in general 
features at least, with the observed facts on the kinetics of the phase transition in 
superconductivity.’ 

The superconductive properties arise because the infinitesimal energy quanta of 
translation of the entire electronic system are unable to excite either phonons or 
electron excitations of the spin-unpairing type. For electronic motion to excite 
phonons the force exerted on the lattice atoms must vary from atom to atom with 
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appropriate wave length. This flow of the entire electronic system will exert a 
uniform nonoscillating force and hence cannot excite phonons. In order to create 
new electron-spin excitations, a larger energy is required than a single quantum of 
translation can provide, unless the velocity of the general motion is high. The final 
possible source of resistance is a transfer of energy of general electron motion to the 
existing electron-spin excitations. Here, as in the case of the phonons, the force 
on the excitation must vary in space, whereas the general electron motion yields 
only a uniform force. These energy-transfer prohibitions are similar to those 
operating for the superflow in liquid He 1. 

We conclude that there is no mechanism for transfer of the energy of a slow 
motion of the entire electronic system. Thus we have the infinite conductivity 
without any conflict with the Bloch theorem, which requires the state with current 
flow to have higher energy than the state of zero current. 

We have already noted the proof by Bardeen? that an energy-level system of 
this type will show the Meissner effect. 

Recently Corak and collaborators” showed that the electronic heat capacity of 
superconductors at temperatures below follows 7’, the exponential equation 


Us ¥T ac sist i (3) 


where the constants a and b have values near 9.17 and 1.50, respectively, for several 


metals. The energy-level pattern given by our theory yields just this type of 
equation with « = bkT,. This is evidence for our earlier assumption that the 


transition temperature is proportional to the energy of excitation e. 

Since our energy term e = kT, depends primarily on conduction electron—lattice 
interactions which are the cause of resistance in the normal metal, we may conclude 
that 7’, will be roughly proportional to the resistance for metals with equal densities 
of conduction electrons. While superconductors generally have high resistance in 
the normal state, one cannot claim that any such quantitative relationship holds. 
The variety of complicating conditions, however, is such that lack of quantitative 
agreement is not surprising. 

We should note that, in contrast to the proposals of Frohlich and Bardeen,'! our 
theory gives no minimum magnitude of lattice interaction below which the metal 
will fail to become superconducting even at 0° K. We predict merely that 7’, 
will be much lower for metals with small lattice interaction. However, there is 
another source of interference with superconductivity which will prevent some 
metals from attaining that state even at absolute zero. This is atomic magnetic 
moment from inner electron shells or possibly even nuclear magnetic moment. If 
the effective magnetic field from this source exceeds the critical field for super- 
conductivity, then no superconductivity occurs. This effect eliminates ferro- 
magnetic (also probably antiferromagnetic) metals from the list of possible super- 
conductors. Since the critical field Ho is roughly proportional to « (and to 7), 
even the nuclear magnetic moment might prevent superconductivity in cases where 
eis very small. However, tests for superconductivity have not been pushed to low 
enough temperatures and external fields to make the nuclear moment an important 
criterion yet. 

We believe that we have here the basis for a theory of superconductivity. It 
yields the observed properties of the superconductive state near 0° K. At a later 
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date we hope to discuss the nature of the transition to the normal state and, if 
possible, to add some more quantitative aspects to the general theory. 

The writer wishes to thank the Guggenheim Foundation for a fellowship and 
Oxford University for its hospitality during the period this paper was being pre- 


pared. 
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CONVERGENT APPROACH TO THE THEORY OF ELEMENTARY 
PARTICLE INTERACTIONS* 


By I. E. Seean 
UNIVERSITY OF CHICAGO 
Communicated by S. Chandrasekkar, July 13, 1956 


1. We sketch here an abstract theory whose results are as yet only qualitative 
but which is nevertheless encouraging in view of the present state of knowledge 
about elementary particles. It appears that a logically complete and funda- 
mentally simple theory may be based on the formulation of elementary particles 
essentially as group representations and that mathematical divergences may be 
eliminated simultaneously with physically nonoperational features. 

2. More specifically, we first indicate how a square-integrable invariant (tensor) 
of the representations defining a set of elementary particles determines, by means 
of the duality transform,' a convergent S-operator for their coupled fields. This 
establishes the existence of nontrivial interactions satisfying the covariance criterion 
of an earlier paper,” but in the realistically crucial case of a bounded invariant our 


procedure is applicable only in a formal sense. It develops, in fact, that in general 


(in particular for linearly coupled bosons with infinitely many states) there can 
exist no actual operator on Hilbert space having the formal (invariance and func- 
tional dependence) properties appropriate to the S-operator. We therefore turn 
to a mathematically modified, although symbolically equivalent, formulation and 
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indicate how a satisfactory theory may be determined by an invariant of the 
generally relevant type. 

The latter approach exhibits in convergent form typical features of “subtraction”’ 
physics. Only those self-adjoint operators A satisfying a certain limit condition 
allowing a meaning to be given to S~'AS correspond to experimental observables. 
This condition excludes in general the total number or momentum of all particles 
of a certain type but includes in significant cases occupation numbers and the mo- 
mentum of particles in individual states. 

On the other hand, standard relativistic radiation theory cannot be made com- 
pletely convergent in this fashion, because the trilinear invariant of the unitary 
representations associated with the Maxwell photon and the Dirac electron is only 
infinitesimal, within the framework of abstract representation theory. As a re- 
sult, these representations with the usual interaction can be formulated within a 
closed system only if a continuum of other kinds of elementary particles is included. 
It seems more likely for a variety of reasons* that what are really involved are 
slightly modified representations of a slightly different group admitting a proper 
invariant, 

3. It is no essential loss of generality to treat the interaction of several, or even 
continuously many, fields as a coupled boson-fermion field, all the elementary 
bosons (resp., fermions) being combined into a “‘universal”’ unitary representation 
U, on (the Hilbert space) H, (resp., U, on H,), whose spectrum‘ consists of the ele- 
mentary particles in question. We assume given (mathematical) conjugations 
J, (resp., J,) that commute with the respective representation operators and play 
the role of fundamental symmetries.* The state space of the field then consists 
of the Hilbert space K = K, @ K,, where K, denotes the Hilbert space of all square- 
integrable covariant symmetric tensors over H,, and K, is correspondingly related 
to skew-symmetric tensors over H,. Now let w be a tensor in A that is invariant 
under the induced action a ~ I'(a) of the basic group G as well as the induced 
action of the fundamental conjugation. The corresponding S-operator is defined 
as S = D~' exp (iLp.~)D, where D is the direct product of the duality transforms, 
from K, and K, to the spaces of square-integrable functionals over the real sub- 
spaces of H, and H, left fixed by the conjugations /, and J,, with respect to the 
canonical normal and Clifford distributions, respectively,! while L, denotes the 
operation of left multiplication by u. The observable A’ of the interacting fields 
corresponding to the free-field observable A is then defined as S’ = S~'AS. The 
invariance properties of w are reflected in the commutativity of S with the ['(a) and 
in its unitary character. It is immediate that the theory is covariant: [I'(a)~! 
AT(a)|' = T(a)—' AT (@). 

!. There is an essentially well-known mutual correspondence between invari- 


ants of representations and special structure of their Kronecker products, but in 


the case of infinite-dimensional representations the tensor that arises from such a 
special structure need not be ‘‘square-integrable” but only “bounded.” This is 
associated with the nonexistence of a true S-operator for the corresponding fields. 

The relevant convergence questions are well represented by the case of linearly 
coupled bosons defined by equivalent representations. Let V be the representa- 
tion on the space M defining one of the bosons, and assume that V is irreducible 
and neutral. It is convenient to use the frame of reference provided by the duality 
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transform, according to which the state space of the coupled fields may be taken as 
the space Lo(M’ @ M’, n) of functionals f(x, y) of the variables x and y, which 
vary independently over the real subspace M’ of M left fixed by the fundamental 
conjugation J, that are square-integrable with respect to the isotropic centered 
normal distribution n. The induced action of the group is then f(x, y) ~ f(V(a)~!2, 
V(a)~'y), where a is arbitrary in G, and the induced action of the fundamental 
conjugation is ordinary complex conjugation, f(x, y) ~ f(a, y). The only bilinear 
invariant is, as a function on M @ M, the inner product u+Jv, and when M is finite- 
dimensional, the corresponding S-operator is simply the operation of multiplica- 
tion by e™*: f(x, y) > e™ f(x, y). When M is infinite-dimensional, e'*” remains 
a well-defined function on M’ @ M’, but it is not a measurable functional with respect 
to the normal distribution, and the indicated transformation does not exist as an 
operator on Hilbert space. 

5. The meaningless character of the inner product as a measurable functional 
(or random variable) arises from the circumstances that x2 and y are both normally 
and independently distributed; when either x or y is fixed, x-y is a measurable 
functional of the other variable. This may be seen by writing x-y in terms of an 
orthonormal basis as xy; + xeijo +... ; here x1, 2, ... and y, yo, ... are two se- 
quences of independently distributed normal random variables of zero mean and 
unit variance. It is then apparent from the theory of sequences of independent 
random variables that the indicated series converges in no reasonable analytical 
sense. 

Now this might not be completely conclusive as regards the nonexistence of the 
S-operator, as conceivably a matter of interpretation is involved. But any inter- 
pretation must satisfy the following formal conditions: (a) invariance under the 
orthogonal group on 7’, or at least under the subgroup consisting of the representa- 
tion operators (a); (b) functional dependence on the operations of multiplication 
by the x; and y,, and so at least commutativity with all operators commuting 
with the z;and y,. It is a fact that no such operator exists, under relatively general 
assumptions about the basic group G and the representation V; e.g., it suffices that 
V be irreducible on a noncompact simple subgroup, such as the Lorentz group.’ 

6. Although there is no S-operator, there is an entirely finite S-matrix, which, 
however, is neither unitary nor proportional to a unitary matrix. To see this, 
take as an orthonormal basis for the state space of the field in the duality frame the 
functionals fm, n(@, ¥) = fmay(X) fray(®) fm) (®) frie (x) . . . describing the state in 
which there are present m(j) bosons of the first kind with wave function e,;, and n(j) 
bosons of the second kind with wave function e;, where ¢, é:, . . . is an orthonormal 
basis for M’, and the m(j) and n(j) are non-negative integers of which all but a 
finite number vanish. Putting S, for the operation of multiplication by exp 

, 
(i >> xyy,), it is readily verified that the relative transition probability amplitudes 
k=1 
(Sif; n*tm’s n')/(S:fo, orfo, 0) converge to finite values A», n,m’) n° aS Tr > ©. One 


readily computes, in fact, Am, nym’; n’ = Il Amie, nies mies ne, Where the djjem are 
© k=1 

° . . . k oe 1/ 
given by the generating function > aijmptq’s*t™/(iljtkim!)°/? = exp [(p? + 


ij,k.m=1 


q+ s? + 0)/4 — (sp + iqt)/2). The use of generating functions shows that 
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Dy | Aims ne m’s nt |2 is divergent in case m’ and n’ describe the states in which pre- 


mn 
cisely one boson of wave function ¢,, of either kind, is present. Thus the S-matrix 
is not proportional to any unitary matrix,* a fact which is also deducible from sym- 
metry considerations. 

7. What is primarily significant, however, is the existence of self-adjoint oper- 
ators on Hilbert space corresponding to the formal expressions S~!AS, for oper- 
ators A corresponding to experimentally crucial observables. In particular, the 
canonical P’s and Q’s, the occupation numbers of individual states (or of finite- 
dimensional manifolds), the momenta of each of a dense set of individual states (or 
of corresponding finite-dimensional manifolds), are represented by well-defined 
operators for the coupled fields. Specifically, we may define a bounded operator 
A on K asa (convergent, possibly complex) observable if the strong limit of Sp--!AS-p- 
exists, as the finite-dimensional subspace F’ of M’ increases to M’. The observ- 
ability of an unbounded self-adjoint operator A is reduced to the case of bounded 
operators by requiring that lim Sp! exp (7tA)Sp exist and yield a con- 

Pinas 
tinuous one-parameter group, — © <t< ™. 

Consider the canonical field operators P; and Q, for the bosons of the first kind. 
The corresponding operators P;’ and Q,’ for the coupled fields may be defined as 
follows. The problem of dealing with exp (7¢P;’) reduces, by virtue of the independ- 
ence of Sp! exp (7tP;)Sph of F’ for large F’ and any fixed tame function h, to the 
proof of the isometry of the action of the limit on tame functions. This, in turn, 
reduces as in the case of a similar problem in an earlier paper! to a straightforward 
one-dimensional computation. In the case of the Q’s there is no change; Q,’ = 


Q,. The P,;’ and Q,’ satisfy the same commutation relations as the P; and Q,, but 
it may be shown that there exists no unitary operator on K that transforms the 


P, and Q, into the P;’ and Q,’, showing in still another way the nonexistence of an 
appropriate S-operator. An argument similar to the foregoing shows the observ- 
ability of occupation numbers of individual states. 

For any self-adjoint generator R of G, dI'(R), where dl denotes the infinitesimal 
representation determined by IT’, may be called the total free-field R-momentum, 
using ‘‘momentum” in a generalized sense to cover any dynamical variable in the 
Lie algebra defining the group in question. The total free-field R-momentum of 
the bosons of the first kind with wave functions in F may be defined correspond- 
ingly as d'(Ry), where Ry = prhp, and pp denotes the operation of projection of 
M onto F. The observability of this operator, for finite-dimensional F’, follows as 
above by reduction to the action on tame functionals of x-e, r-és, .. . *e,, Where 
F is contained in the complex linear subspace determined by @, é2, . . . , €n- 

On the other hand, the total population of bosons of the first kind, or their total 
momenta, are not observables in general. To show that the total population is 
not observable, it suffices to show that, for some ¢, the transform under S,- of the 
field action induced by the transformation « > e“x, y —~ y, does not converge. 
Taking t = 7, the field action is the Wiener-Plancherel transform with respect to 
the variable x, and the action on the function that is identically one is readily com- 
puted and seen not to converge. Now turning to the case of the total F-momen- 
tum of the bosons of the first kind, S»y--!T (exp (7tR))S,p- takes f(x, y) into the 
product of f(V(exp (itR»))~!x, y) with exp [7(2 — V(exp (itRp))~'x)+y], where F 
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is the complex extension of /'’. Now as F’ + M’, V(exp (ttRy)) > V(exp (2tR)) 
strongly, and the question reduces in part to the measurability of T’x-y as a func- 
tion of x and y for the bounded normal operator 7’ V(exp (itR)) — I. This 


requires that 7 be completely continuous,’ so that the total R-momentum is ob- 


servable only when V(exp (7tR)) — J is a completely continuous operator, — © < 
i < , an unrealistically severe restriction and an impossible one if, e.g., @G is an 
open simple Lie group. 

8. In the case of higher-order interactions the same general procedure applies, 
but there tend to be fewer observable operators, and there are greater mathematical 
difficulties in their determination. In the situation of Section 3, any finite-dimen- 


, 


sional real subspaces F,,’ and F,’ of H, and H, determine as in the case of linearly 
coupled bosons a square-integrable invariant, which converges to the given bounded 
invariant as the finite-dimensional subspaces become arbitrarily great. The ob- 
servable operators are those whose transforms via the unitary operators S»p,-, ¢,’ 
deduced from the approximating tensors are convergent as F’,’ > H,’ and F,,’ > H,’. 

In the important special case of the linear-bilinear coupling between a neutral 
boson and a charged fermion, a covariant coupling formally similar to that in 
quantum electrodynamics exists (resp., is unique) if and only if the boson repre- 
sentation is contained discretely (resp., essentially once) in the skew-symmetric 
Kronecker square of the fermion representation. As is suggested by the divergent 
character of the usual theory of the coupled photon-electron fields, this condition is 
violated in the case of the Maxwell photon and the Dirac electron, and, in fact, 
the Lorentz group cannot be used as a basis for a realistic theory of the present type, 
as the Kronecker products of its relevant representations decompose continuously.® 
On the other hand, there exist other continuous groups having discrete series of ir- 
reducible representations, whose Kronecker products decompose discretely in part 
and which possess representations approximating in a sense those of the Maxwell 
photon and the Dirac electron. The present situation in the classification of ele- 
mentary particles gives an additional indication for a group close to, but distinct 
from, the usual extension of the Lorentz group, and, in particular, isotopic spin may 
well be explainable in terms of a new outer automorphism analogous to the usual 
reversal operations. 

9. We mention briefly certain formal matters that are not directly pertinent to 
convergence questions but are needed in the physical interpretation of the fore- 
going. The use of the canonical conjugation involves the following further axioms: 
(1) only those operators that commute with the canonical conjugation may repre- 
sent observables; (2) the observable physical momenta are not precisely the gener- 
ators of G but rather the self-adjoint products of such generators with the sign of 
the generator of “translation in time.”” These assumptions are consistent with 
the fundamental requirements of quantum phenomenology.’ They insure the 
positivity of the energy and eliminate in a covariant fashion the need for hole 
theory. As special consequences, the parasitic Lorentz momenta are eliminated 
as observables, and the central element of order 2 in the universal covering group 
of the Lorentz group (which should persist in any more exact theory) gives a uni- 
versal elementary charge operator which clarifies the principle of conservation of 
charge. 
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The automorphism group of the basic group must also be represented by trans- 
formations on the one-particle space, and this gives rise to some additional ele- 
ments of structure in the definition of elementary particles, which, however, do not 
affect quantization procedures. The use of the fundamental conjugation makes it 
possible to insure that the outer automorphisms, which include the usual reversal 
operations, are represented by unitary operators rather than in part by anti- 
unitary ones. 

10. It is probably premature to speculate on the structure of the system of all 
elementary particles, but it should be illuminating to give a simple example of a 
mathematically conceivable form for it. Let G be an arbitrary continuous group, 
and take as the measure on G the essentially unique left-translation-invariant one. 
Let H, denote the space of all square-integrable skew-symmetric functions g(a, y) = 
—qg(y, 7) on G @ G, and define U;,(a) g(x, y) = g(a~'x, a~"y). Now let H, denote 
the space of all square-integrable functions on G that transform completely in- 
equivalently to any subrepresentation of U’, under the regular representation h(.c) > 
h(a~'x), and define Uj(a) h(x) = h(a~'x) for h in Hy. The fundamental conjuga- 
tions J, and J, may be defined as ordinary complex conjugation, and representa- 
tives U,’(a@) and U,'(a) for an arbitrary automorphism a@ of G may be defined as 
follows: U,’(a) g(x, y gia"(x), a "(y))ug and U,s'(a) h(x) = h(a~(x))Xgq, 
where yu, and \, are positive constants uniquely determined by the condition that 
the operators be unitary. Then ff g(x, y) h(x) h(y) dx dy is a linear-bilinear 
invariant under the representations U, the fundamental conjugation, and the 
representations U’, and therefore gives rise to a mathematically consistent, well- 
determined, closed system of interacting elementary particles. 

An identification with a physical system depends on the labeling of the mathe- 
matical operators in terms of experimental observables, which requires further 
specification of G. The discrete spectra of U’, and U, give countable families of 
elementary particles, in which suitable pairs will form a closed subsystem formally 
analogous to the photon-electron system. The nontriviality and uniqueness of 


such couplings (and hence the classification of the elementary particles observable 


as such) would, however, depend on the specifie decomposition of the Kronecker 
products of the relevant representations of G. At present there is no clear physical 
analogue for the contribution of the continuous spectra of U, and l’, or, more rele- 
vantly, the continuous parts of the decompositions of the Kronecker products of 
the representations in the discrete series of elementary particles, but it seems 
plausible that their effect may in part resemble gravitation. The continuous 
spectrum may reasonably be expected to be of no greater significance in the treat- 
ment of systems of a small, finite number of types of particles than in atomic theory. 

* Work done in part under a contract between the Office of Naval Research and the University 
of Chicago. 

' Trans. Am. Math. Soc., 81, 106-134, 1956; Ann. Math., 63, 160-175, 1956. 

2 These PRocEEDINGS, 41, 1103-1107, 1955. We point out that the assumptions under which 
the uniqueness of coupling constants is proved in this note do not in general hold in the present 
theory, although they are valid in certain standard versions of relativistic quantum field theory. 

3 Cf., in particular, Duke Math. J., 18, 221-265, 1951, last section. 

‘ We define the ‘‘spectrum’”’ of a group representation as the collection of its irreducible con- 
stituents. This is a generalization of the notion of the spectrum of a self-adjoint operator A, 





676 PHYSIOLOGY: BRUCE AND PITTENDRIGH Proc. N. A. S 


which is equivalent to the spectrum in the present sense of the one-parameter group }e'*; — « 
t< o}. We make the analogous distinction between the discrete and the continuous spectra of a 
representation. 

5 An irreducible representation may be called “neutral” if it commutes with some conjugation, 
i.e., a transformation C such that C? = /, C(yx + y) = 7x + y for any complex scalar y, and 
Cxr-Cx = x-x. The conjugation is then unique within sign, and either one may be taken as 
fundamental. A charged representation is defined as a non-neutral representation that is uni- 
tarily equivalent to its transform under some conjugation. An essentially unique conjugation 
then arises through the inclusion of the antiparticle. In either case the representation space 
is irreducible under the combined action of the representation operators and the fundamental 
conjugation. Particles that are neither charged nor neutral have not as yet been proved to exist 
physically and may in any case be treated in the same way. 

6 We speak of bosons of the first and second kind. In the present theory a (convergent) bilin- 
ear coupling between neutral elementary bosons is possible only when their representations are 
equivalent, and it is convenient and no loss of generality to assume that the representations are 
identical. The bilinear self-interaction of a single boson field could be treated along the same 
lines. 

7 This is proved by the author in a paper submitted to a mathematics journal. 

’ Except in an essentially rhetorical sense, i.e., via an infinite constant. In a somewhat more 
rigorous sense, it is a transform of a unitary operator by an unbounded operator. 

9» Cf. G. W. Mackey, Ann. Math., 58, 193-221, 1953. 

0 Cf. Ann. Math., 48, 930-948, 1947. 


TEMPERATURE INDEPENDENCE IN A UNICELLULAR “CLOCK’’* 
By Vicror G. Brucet AND CoLINn S. PrrrENDRIGH 
DEPARTMENT OF BIOLOGY, PRINCETON UNIVERSITY 
Communicated by Maz Delbriick, June 21, 1956 


Introduction.—Endogeneous diurnal rhythms have been observed in living 
organisms of all types, from micro-organisms to mammals, and the endogenous 
nature of these rhythms has long been recognized. Recent interest in these rhythms 
has been stimulated by the investigations of Frisch,! Kramer,? Pardi and Papi,’ 4 
and others which have shown that the celestial navigation of bees, birds, and amphi- 
pods is made possible by a biological chronometer of the endogenous type. The 
implication that endogeneous rhythms serve generally as a time sense for organisms 
is reinforced by several observations of temperature independence of the period of 
these rhythms. Clearly, if the rhythm is to be an accurate time-giver in nature, 
its rate must be effectively independent of temperature within ecological limits. 
Temperature independence of the clock systems in poikilotherms has been demon- 
strated as long ago as 1932 by Wahl, using the bee. More recently, Brown and 
Webb‘ showed temperature independence of the rhythms in the crab Uca and the 
clam Venus; Pittendrigh’ found the rhythm of eclosion in Drosophila to be inde- 


pendent of temperature; and Rao* showed the same for a tidal rhythm in the mussel 
Mytilus. Recent work on persistent daily rhythms in plants (Ball and Dyke,’ 
Biinning and Leinweber"’) has also shown the same insensitivity of the period to the 


temperature regime. 
In 1948 Pohl'! showed that the single-celled motile green alga Euglena gracilis 
has an endogenous rhythm, but he did not investigate the temperature character- 
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istics of this rhythm. The present paper reports temperature independence of the 
Euglena rhythm. Thus the endogenous rhythm in Luglena exhibits the important 
characteristics of the endogenous rhythms in higher organisms without the com- 
plexity of multicellular organization. 

Nature of the Endogenous Rhythm in Euglena.—The rhythm in Euglena which 
has been investigated is a rhythm in the phototactic response. Euglena has an eye 


spot (concentration of pigment) and is photosensitive, in addition to being motile. 


The phototactic response is thus dependent on the motility and the photosensi- 
tivity, and by the method employed in the present investigation these factors can- 
not be separated. Although many factors affect the phototactic response, conditions 
may be standardized sufficiently so that the pattern of phototactic responses ob- 
tained in one day is repeatable the next day. In the present investigation a narrow 
beam of light (intercepting about 2 per cent of the culture volume) is sent through 
a suspension of algae. The beam of light falls upon a photocell, and the current 
from the photocell is recorded automatically. This beam of light serves two main 
functions. It acts as a “‘light-trap” to attract the Euglena, and it serves also as 
part of the sensing system to measure how many cells have been attracted to the 
light and how quickly they are attracted there. The beam of light, which will 
hereafter be called the “test light,” is turned on for a short time (generally for 30 
minutes) at periodic intervals (usually every 2 hours). The recorded pattern of 
the reduction in current from the photocell during the exposure of the culture to 
the test light is a measure of the aggregation of Euglena in the beam of light and 
will be referred to as the “‘phototactie response.”’ The phototactie response is thus 
measured in a population of cells and not in individuals. There is a daily rhythmi- 
cal change in the character of the Euglena phototactic response assayed in this 
way: the cells are positively phototactic by day and show little or no response at 
night. This is a true persistent daily rhythm in the sense that it endures in the 
absence of any 24-hour cycle of light or temperature. 

Materials and Methods.—Euglena gracilis, strain z, obtained from Dr. 8S. Hutner, 
of the Haskins Laboratories in New York, has been used for these experiments. 
The experiments reported here have all been done with Euglena grown in simple 
inorganic medium of the following composition: KNQO,; 0.1 per cent, MgSO,-7H.O 
0.01 per cent, Ke2HPO, 0.02 per cent, FeCl; 0.0001 per cent, CaCl, 0.0001 per cent. 
Cells in cultures grown in this medium remain viable and active for months if they 
receive sufficient light. 

All observations are made and recorded automatically, using a system patterned 
after that used by Pohl in his investigation. The apparatus is shown schematically 
in Figure 1. Approximately 10 ml. of culture are contained in a 50-mm.-diameter 
Carrel flask. A microscope lamp directly below the culture throws a vertical beam 
of light about 7 mm. in diameter (the test light) on to a Silicon Solar Cell. An 
adjustable fraction of the voltage output of the Solar Cell is recorded without ampli- 
fication on a Brown Strip Chart Electronik Potentiometer. A so-called “day 
lamp” consisting of a 4-watt fluorescent lamp is located laterally to the culture 
and provides illumination for a day-night cycle. In operation the test lamp has 
generally been turned on for 30 minutes every 2 hours, and the Chart Recorder 
only operated when the test lamp was on; thus the test records generally consist of 
half-hour “responses” at 2-hour intervals. The day lamp is always off when the 
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test lamp is on. The culture vessel is mounted in a black lucite chamber, and the 
whole assembly is kept in a light-tight, temperature-controlled cabinet. 

The nature of the test setup is such that the phototactic response of a culture 
grown and kept in strictly constant conditions cannot be tested. The only con- 
ceivable way in which this could be done would be to remove aliquots periodically 
from a culture kept in constant conditions and to test the aliquots for phototactic 


response, after which they could be discarded. This is impractical not only be- 


cause shaking affects the phototactic response but also because the rhythm is lost 
in constant light and because the cells become inactive in continuous darkness. 
Thus the rhythm of phototactic response must be studied under conditions in 
which the culture is exposed to the repeated stimuli of the test lamp. For con- 


‘ 


venience, subsequent descriptions will refer to such conditions as ‘‘constant condi- 
tions.” 

Pattern of Rhythm in Alternating Light and Darkness (LD).—Figure 2 (April 13) 
shows a typical record of phototactic responses of a culture which has been exposed 
to a twice-repeated cycle of 12 hours of the 4-watt fluorescent day lamp followed by 
12 hours of darkness with dawn at 10:00 in the morning. This cycle of day-lamp 
light and dark is referred to as ‘LD conditions.”” The test lamp was turned on for 
30 minutes at 2-hour intervals on the odd-numbered hours. The detailed nature of 
the pattern of responses depends on several factors, including the light intensity 
(and color) of the day lamp, the intensity, duration, and repetition frequency of the 
testing light, and the temperature. The period of the rhythm is, however, always 
24 hours in an LD cycle of 24-hour periodicity. 

Pattern of the Rhythm in ‘‘Constant Conditions.’’—With the day lamp continu- 
ously on (LL), no continuing rhythm has been observed. This loss of an overt 
periodicity in conditions of constant light, although not understood, is a feature 
common to many of the persistent rhythms. No study has been made of the way 
in which the Euglena rhythm is lost after transfer to LL. 

With the day lamp continuously off (DD), a rhythm does persist as shown in 
Figure 2 (April 14 and 15) which is taken from an experiment using 30-minute test 
lights. Using a shorter duration for the test-light cycle (approximately 10 min- 
utes every 2 hours), however, even the DD rhythm is rapidly lost. Thus the per- 
sistence of the phototactic rhythm in DD is evidently dependent on the cells re- 
ceiving sufficient energy via photosynthesis from the light of the test lamps. 

It is necessary to record for at least 4 or 5 days in order to get a reasonably accu- 
rate estimate of the period length in DD. It greatly simplifies the graphical repre- 
sentation of the data to record only the potentiometer reading corresponding to the 
maximum response of the Euglena during each 2-hour interval, and this seems to be 
a quite satisfactory method for determining the average period. Successive days 
are plotted one below another, with a constant displacement of the potentiometer 
scale from day to day. Any departure of the period from 24 hours thus shows up 
as a gradual shifting of the time of day at which the minimum response occurs. 
The persistence of an overt rhythm in DD with a period which is not exactly 24 
hours or an integral multiple of the period of the test lamps is evidence that we are 
dealing with an endogenous rhythm. 

Temperature Independence.—¥igures 3 and 4 show the results of DD experiments 
at different temperatures. It can be seen that the period of the rhythm at 33°, 
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26°, 23°, and 16.7° is always very close to 24 hours.'? In this sense the period of 
the rhythm is considered to be essentially temperature independent. At the lower 
temperature there is a more noticeable damping of the rhythm than at the other 
temperatures. This may be due to a more pronounced nonsynchrony of the 
cells in the culture, or it may be due to the lowered efficiency of conversion of light 
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Fig. 1.—Schematie diagram of apparatus for 
testing and recording the phototactic response 
in Bugle na. 
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Fic. 2.—Potentiometer chart record of phototactic response. The greater amplitude of the 
“responses” at the hours 11, 13, , 21 corresponds to a greater change in the voltage across 
the photocell as a result of the aggregation of the Euglena in the test beam. The test lights 
were on for 30 minutes every 2 hours. April 13, LD: The day lamp was on from hour 10 to 
hour 22. April 14 and 15, DD: The day lamp was off. Temperature 20° C. 


energy at the lower temperature. At all temperatures, however, there is a sig- 
nificant damping of the rhythm, and this renders impossible a precise quantitative 
characterization of the very small temperature dependence suggested by the data. 

Other Similarities of the Euglena Rhythm to Rhythms in Higher Organisms.— 
The endogenous rhythm exhibits characteristics other than temperature independ- 
ence which are similar to those found in higher organisms. One of these is the 
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EASTERN STANDARD TIME Fia. 3 (left).—Results of experiments 


illustrating the virtual temperature in- 
o1°3°5 FOB 6 02 dependence of the diurnal rhythm of 
| phototactic response in Huglena in DD 
$4, i aie conditions. The circled points repre- 
K ~APR 20 sent the potentiometer reading cor- 
' + , aa: . 
re. a The responding to the maximum photo- 
at —- 22 tactic response recorded each time the 
test lamp is on. 
23 The numbers on the left-hand scale 
are the potentiometer readings cor- 
24 responding to hour 22 on successive 
days. The light horizontal lines are 
15 ten units apart, and each day the scale 
is shifted ten units. The inelined 
DAY LAMP ON—; lines passing through the minimum 
ss UG Bes oe ee ee points on the response curves illus- 
trate the slight departure of the period 
from 24 hours and form the basis for 
the estimate of the period at the dif- 
ferent temperatures. Twelve hours of 
the day-lamp light was used on May 
25-26 to reset the phase of the rhythm 
as described in the text. 
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Fic. 4 (above). 
Summary of Figure 3, 
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ture independence of 
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phenomenon of resetting'* (phase shift) by a single perturbation with light. This 
is well enough exemplified in Figure 3. The day lamp was turned on for 12 hours 
on May 25-26. It is clear that this single perturbation establishes an entirely new 
phase without affecting the period. 

Other experiments show that an arhythmic population can be made rhythmic 
by a single light stimulus. This is precisely analogous to the situation found in 
Drosophila, and, as in the latter case, two possible interpretations exist and cannot 
be resolved. We are not certain whether a single stimulus initiates rhythms in 
cells or merely establishes synchrony in a population of rhythms with randomly 
distributed phases. 

Discussion.—No satisfactory physiological explanation of the remarkable fact 
of temperature-independent endogenous rhythms has been advanced. Three 
types of mechanisms have been suggested. These are as follows: (1) The diurnal 
periodicity is maintained by some uncontrolled variable with a diurnal periodicity. 
(2) The periodicity of the clock derives from some relatively temperature-inde- 
pendent physical (as contrasted with chemical) process. (3) Temperature inde- 
pendence is achieved by virtue of a temperature-compensating mechanism within 
the organism, although component parts of the ‘“clock’” may be temperature de- 
pendent. The first mechanism is unsatisfactory for those cases, such as Euglena, 


“cc 


where the periodicity in ‘constant conditions” is not exactly 24 hours. The fol- 
lowing consideration favors an interpretation in terms of a temperature-compen- 
sating mechanism. Repeated and nonrepeated temperature changes have been 
demonstrated to affect endogenous rhythms in several ways. The entrainability 
to periodic temperature cycles and the influence of single temperature shocks in 
initiating rhythms, introducing transients, or shifting the phase of a rhythm are’ 
examples which have been demonstrated in several organisms. An argument 
against the second mechanism hinges on the fact that the diurnal (24-hour) rhythm 
in Euglena persists even when the cells are growing in organic media so rapidly that 
they divide more frequently than once every 24 hours. There is implied in the 
second mechanism for achieving temperature independence the idea that the pe- 
riodicity of the “‘clock”’ is intimately tied into the physical localization of the com- 
ponents of the “clock.” The ability of the “clock” to continue running while it is 
duplicating itself is an argument against this mechanism. 

No model can be proposed for the clock in Euglena; however, we should like to 
point out that the rhythm possesses certain features which are characteristic of a 
class of periodic phenomena known in physics and mechanics as self-sustained 
relaxation oscillations. This aspect of endogenous rhythms has been discussed 
in some detail by Pittendrigh,'* and the subject is not sufficiently well explored in 
Euglena to merit further consideration here. 

Summary.—The endogenous diurnal rhythm in the phototactic response of 
Euglena has been shown to be temperature independent in the sense that the 
period of the rhythm at constant temperatures ranging from 16.7° to 33° is always 
very close to 24 hours. Attention is drawn to the fact that this most important 
feature of endogenous rhythms has now been demonstrated to occur at the uni- 
cellular level and that, in looking for a common mechanism in biological “clocks,” 
one need not go to the complexity of nervous or multicellular organization. 
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INFLUENCE OF PROTEIN AND CALCIUM ADDITIONS 
TO ALREADY ADEQUATE DIET* 
By Henry C. SHERMAN,t ConsTANCE S. Pearson, Mary E. R. BAt, 
ANDREA McCartuy, AND CAROLINE SHERMAN LANFORD 
DEPARTMENT OF CHEMISTRY, COLUMBIA UNIVERSITY, NEW YORK 27, NEW YORK 
Communicated June 25, 1956 

Laboratory-bred rats have, in repeated instances, thrived for seventy gener- 
ations or more upon a uniform diet consisting exclusively of one-sixth dried whole 
milk and five-sixths ground whole wheat with table salt and distilled water (Diet 


A, Laboratory No. 16). We emphasize “exclusively” because some readers have 


mistakenly supposed that these animals received other food occasionally. Per- 
haps it should also be emphasized that these rat families have continued to thrive 
until killed for lack of space to keep them longer under the strict laboratory con- 
trol of these experiments. Families of them are still (1956) thriving in the eighty- 
second generation—probably uniquely rigorous evidence that this basal diet meets 
all of these animals’ needs generation after generation indefinitely.' 

This basal Diet A (Laboratory No. 16) is therefore certainly adequate in the 
strictest sense of any ordinary use of the word; yet, as our previous papers have 
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shown, it is not optimal but can be improved by increasing its content of calcium 
or vitamin A or such a natural food as milk. The basal diet contains about 0.2 
per cent of calcium and about 14 per cent of protein in the air-dry mixture. 

The present paper summarizes those of our experiments in which the basal diet 
just described has been supplemented by extra protein either alone or with simul- 
taneous increase of calcium content. 

Long-Term Nutritional Experiments with Added Protein.—In the first series of 
these experiments the above-described basal diet (of which the animals ate ad 
libitum) was supplemented by the addition of 5 gm. of lean meat per capita on 
6 days of each week. This was equivalent to increasing the protein content of the 
diet from about 14 to about 24 per cent of the total solids of the supplemented diet. 
The protein-supplemented diet is designated ‘Diet 16P5.’’ Some of the findings 
of this first series of experiments have already been published.’ 

Such raising of the protein-intake level results, as in the similar experiments of 
Slonaker,’ in more rapid growth and earlier puberty. The adult body weight is 
greater, in both sexes, and the young are heavier at 28 days of age, when the diet 
contains added protein. 

The data on such of the animals as had completed their life-cycles at the time 
the earlier report was published indicated no effect on the longevity of the males 
and suggested that the added protein might shorten both the life-span and the 
“period of the prime” of the females. When account is taken of the animals still 
living when the 1950 report? was written, the final averages again strongly suggest 
that the addition of protein alone to Diet 16 shortens the length of life and the 
duration of reproductive activity of the females (Table 1). For the males, on the 
other hand, raising the protein level had, if anything, a slightly beneficial effect as 
measured by the length of life. 


TABLE 1 


FINAL RESULTS ON ANIMALS SHOWN IN TABLE 2 
oF 1950 Report 
Difference Critica 
Diet 16 Diet 16P5 with Its PE. Ratio 
Length of reproduc- 394 + 8.5 ‘ 79 + 18.0 4.4 
tive life (days) 
Length of life (days): 
Males 660 + 20 7 26.2 1 
Females 855 + 16 770 + 2% 3 28.0 3.0 


As was noted in the earlier papers, the life-history of the females after attaining 
maturity is uncertain. Some of the animals receiving the extra protein maintain 
higher records than their controls throughout, while others develop nervous in- 
stability with symptoms suggestive of calcium deficiency. Analysis of typical ani- 
mals showed that such increase of the protein level from 14 to 24 per cent of the 
total solids of the food did result in a lower calcium content of their bodies than in 
those of their controls. Most of the animals recovered spontaneously, but an 
appreciable percentage of the females died, often after miscarriage. Their ab- 
normality was undoubtedly due to their diets and might be explained in any of three 
ways: shortage of calcium at a time of rapid growth, or an imbalance between the 
intake levels of calcium and protein, or the paucity of the calcium intake may be- 
come manifest when the growth rate is accentuated by increased protein intake. 
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Or, in other words, the lowering of body calcium by protein enrichment of a low- 
calcium (minimal adequate) diet may be regarded as an effect of a protein-caleium 
imbalance or of an undue acceleration of growth by the increased protein intake or 
of both. 

The extent of this hazard to breeding females may be illustrated by the following 
comparison: Of forty females on the basal diet containing 0.2 per cent of calcium, 
three animals (7.5 per cent) died between the ages of one month (“end of infancy’’) 
and one year (early adulthood), while, of forty parallel cases of the same heredity 
and nutritional background, but receiving the supplement of lean meat as de- 
scribed above, seven animals (17.5 per cent) died during the same age period. 
Thus in these experiments the death rate of females in this age period was more than 
twice as high on the higher protein-intake level. Here the imbalance followed in- 
crease of the protein, without change in the calcium content, of the diet. Experi- 
ments were then made to determine whether this hazard could be reduced or obvi- 


ated by increasing the intake level of calcium at the same time as that of protein. 
Qualitatively, this has been found to be the case; and experiments designed to 


study quantitatively the effects of simultaneous supplementation of the diet with 
protein and calcium are reported in the next section. 

Effects of Simultaneous Enrichment with Protein and with Caleium.—In the second 
series of experiments three diets were compared: Diet 16P5, the protein-enriched 
diet of the earlier series, which had a protein level of about 24 per cent and a cal- 
cium level of about 0.2 per cent, on the dry basis; Diet 162P5, in which the protein 
level was kept at 24 per cent but the calcium level was raised to 0.4 per cent; and 
Diet 168P5, with the same protein level but still higher calcium (0.6 per cent). 

Using representative 28-day-old animals from our Diet 16 colony, and applying 
strictly the principle of littermate control, six breeding lots (each consisting of two 
males and three females) were started on each of the three diets.’ In addition, one 
breeding lot of second-generation Diet 162P5 and two of second-generation Diet 
168P5 animals were observed. 

In Table 2, summarizing the effects of the three diets, the three right-hand col- 
umns represent the average results for those cases only in which littermate parallels 
were observed, while the three left-hand columns include in the average all reason- 
ably comparable cases observed on the three diets. 

In Table 3 the comparison is between those animals which received only the meat 
supplement and all those which received supplementary calcium as well (whether 
to the 0.4 or the 0.6 per cent level). 

When calcium as well as protein is added to basal Diet A, the growth rates of 
both males and females are still further increased, and greater body weights are 
maintained throughout adulthood. These effects are clearly significant from a 
statistical point of view. 

The females given additional calcium reach adulthood at a significantly earlier 
age and, on the average, maintain the ability to bear young much longer (though the 
latter effect is not proved conclusively by the number of data at hand). The number 
of young borne, the number reared, and the average weight of the young at 28 days 
of age are slightly greater for the calcium—protein-supplemented than for the pro- 
tein-supplemented groups, but the differences are not statistically significant. 

The data suggest (though they do not prove) that the supplementary calcium 
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may increase the average length of life. As was the situation in a much earlier 
study® in which calcium was added to Diet 16, the benefit appears somewhat greater, 
and the effect more convincing, in the case of the males than in the case of breeding 
females. 


TABLE 2 


Errecrs or SuPPLEMENTED WHEAT-AND-MILK Diets CONTAINING 24 PER CENT PROTEIN 
AND 0.2, 0.4, on 0.6 Per Cent Catcium 
(Mean with Its Probable Error) 
ALL COMPARABLE CASES LITTERMATE PARALLELS ONLY 
Diet 16P5 Diet 162P5 Diet 168P5 Diet 16P5 Diet 162P5 Diet 168P5 
(0.2% Ca) 0.4% Ca) 0.6% Ca) 0.2% Ca) (0.4% Ca) 0.6% Ca) 
Males 
Gain in weight, 28-56 days 84.3 + %: + 97.222 3 $ § 96.82+2.9 98.3 
of age (gm.) : (12) (12) 
Weight at 100 days (gm.) 


Weight at 210 days (gm.) 
Weight at 365 days (gm.) 
Weight at 500 days (gm.) 
Length of life days) 


Females 
Gain in weight, 28-56 days 
of age (gm.) 
Weight at 100 days (gm.) 


Weight at 210 days (gm.) 
Weight at 365 days (gm.) 


Weight at 500 days (gm.) 228 + ¢ 
(15) 
Age at birth of first young 113.0 + ¢ 
(days) (21) 
Number of young borne 41.0+2 
(18) 
Number of young reared 34.7 
(18) 
Average weight of young at 45.4 + 
28 days of age (gm.) 
Length of reproductive life 319 + 386 + 
(days) (18) 20) 
Length of life (days) 807 + If 870 + 2 
(18) 20) 


* Figure in parentheses indicates number of cases. 
+ Numbers vary due to exclusion of pregnant females. 


Above and beyond the benefits of calcium supplementation which can be meas- 
ured objectively, the experienced observers who repeatedly handled the animals 
remarked further on the improvement which “‘is visible in their alertness and gen- 


eral appearance and palpable in their superior muscle tone and firmness of sub- 


cutaneous tissue.’”? 

As between the two higher levels of calcium studied, no consistent or significant 
differences in response could be observed, either in the appearance of the animals 
themselves or in their vital statistics. 

The Problem of Optimal Intakes.—The present findings take on added interest 
in the light of earlier experiments in which the calcium level only was increased.® ° 
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Starting with the basal Diet A, we now see that an increase either in the protein 
level (from 14 to 24 per cent) or in the calcium level (from 0.2 to 0.4 or 0.6 per cent) 
increases the rate of early gain as well as the weight of the offspring at the end of 
infancy, results in larger adult size, hastens the attainment of maturity, increases 
the number of young borne and the number reared, and extends the length of life 
of the males. The influence of added protein is somewhat more marked than the 
influence of added calcium as regards body weight and gain and somewhat less 
marked so far as the other effects are concerned. Adding both protein and calcium 
gives better growth of both sexes, and, apparently, longer life for the males, than 
adding either factor alone. 
TABLE 3 
Errects oF AppInG Catcium To Diet ConTAINING 24 Per CENT PROTEIN 
AND 0.20 PeR Cent CaLciumM: STATISTICAL 
SIGNIFICANCE OF DIFFERENCES 
Without With Difference 
Ca Suppl.* Ca Suppl.t with Its P.E 
Body weight (gm. ): 
Gain, 28-56 days of age: 
Males 84.3 96.5 
Females 62.7 72 
Weight at 100 days of : 
Males 214 250 
Females 162 179 
Weight at 210 days 
Males 293 325 
Females 204 212 
Weight at 365 days 
Males 319 352 
Females 220 243 
Weight at 500 days 
Males 308 § 9 
Females 228 + 4 
Length of life (days): 
Males 667 Mak? 
Females ‘ 807 36 + 2 
Reproductive success of females: 
Age at birth of first young (days) 113.0 ¢ 15.1 +3.65 
Length of reproductive life (days) 319 48 + 20.2 
Number of young borne 41.0 é 13+3.0 
Number of young reared 34 3.0 + 2.73 
Average weight of young at 28 days (gm. 45.4 0.6 + 0.86 


7.1 
4.2 


* Animals on Diet 16P5; see Table 2. 

¢ Includes animals on Diets 162P5 and 168P5; see Table 2. 

On the other hand, increase in the calcium intake alone extends the period of the 
prime (as measured by the duration of reproductive capacity of the females) and 
seemingly also the length of life of the females, whereas increase of the protein in- 
take alone appears to shorten both the prime of life and the length of life of the 
females. The question whether females on the high-calcium, high-protein diet 
either live longer or retain the ability to reproduce longer than females on the basal 
diet alone appears doubtful from the data at hand. A confident answer would re- 
quire comparison of littermates on the two diets, which has not been made. 

Thus it would seem that increasing only the protein intake of our Diet A (from 
14 to 24 per cent) has effects on the life-history which, while beneficial at some 
points, are nonbeneficial at other points. In order to overcome the potential haz- 
ards of the higher protein intake, the calcium intake must also be liberal. There 
seems, however, no reason to believe that the level of calcium which gives the best 
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results when the protein intake is 24 per cent is any higher than the level of caleitum 


which gives the best results when the protein intake is 14 per cent. The present 
studies seem to indicate that the most beneficial level of calcium intake when the 
protein intake is liberal lies in the range of 0.4—-0.6 per cent, or two to three times 
the level of minimal adequacy. In earlier studies,* where calcium alone was added 
to Diet A, the level of optimum intake appeared to be at least three, and perhaps 
as much as four, times the level of minimal adequacy. 

Finally, it would seem justifiable to conclude that a diet liberal both in calcium 
and in protein is clearly superior to a diet liberal only in protein. 


The authors wish to express their deep appreciation to Dr. Charles Glen King 
for having made possible the completion of this study. In particular, the junior 
authors wish to thank him for his continued support, encouragement, and guidance 
after it became necessary for the senior author to withdraw from active partici- 
pation in the research. 

* This work has been aided by grants to Columbia University, at first from the John and Mary 
Markle Foundation, more recently from the Nutrition Foundation, Inc., the Williams-Waterman 
Fund in the Research Corporation, and Swift and Company. 

+ Deceased October 7, 1955. 
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MEASUREMENT OF IMPOSED VOLTAGE GRADIENT ADEQUATE 
TO MODULATE NEURONAL FIRING* 


By C. A. TerzuoLo anp T. H. Buttock 
DEPARTMENT OF ZOOLOGY, UNIVERSITY OF CALIFORNIA, LOS ANGELES 
Communicated by H. W. Magoun, July 6, 1956 


Many authors'~*! have described the effects of polarization by imposed electric 
current upon nerve cells. We have not seen in the literature, however, a quanti- 
tative evaluation of the sensitivity of nerve cells to electric fields in terms of volt- 
age gradient across some appropriate dimension of the neuron. We have under- 
taken to estimate the threshold value as being the unique value of greatest interest 
and have found this to be far lower for modulation of the frequency of an already 
active neuron than for the excitation of a silent one. 
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In selecting the measurement to be preferred for evaluating such an electric field, 
the estimation of voltage drop across some chosen region is a second choice to the 
estimation of current density through a known strategic membrane; but since we 
do not know how to measure the latter, we are forced to this choice. Furthermore, 
the selection of structures across which to measure voltage drop presents diffi- 
culties. At first glance it may appear that the structure of choice would be the 
cell membrane and that the inside-outside potential should be the most significant 
one to measure before and during threshold polarization. Upon further con- 
sideration, however, it will be realized that there will be no characteristic value for 
this membrane potential change, since in an equatorial region of the cell, with re- 
spect to the axis of polarization, the potential across the membrane will not be 
changed at all during polarization, and on one side of this line it will be increased 
and on the other side decreased. It will be shown, moreover, that the maximal 
values to be expected near the poles will be virtually beyond the limits of the sensi- 
tivity of the high impedance electrode wide-band amplifier systems we have used. 


As an approximation, therefore, since we must measure the potential between two 
points rather than between the whole dendritic surface and the axon hillock, for 
example, we have measured the gradient across the whole soma in the external field, 


in an axis of polarization shown experimentally to be the most effective. 
MATERIALS AND METHODS 


The requirements for a suitable preparation for the above measurement are as 
follows. An accessible neuron, whose activity can be followed continuously with 
assurance that this activity originates in the cell under measurement, must be 
active at a steady level, so that alteration of its frequency by the experimental con- 
dition will be noticeable. In addition, we must be able to exclude the possibility 
that the effects obtained are exerted through presynaptic alterations in other neu- 
rons. An ideal preparation which satisfies these conditions is the nonadapting 
stretch receptor of Crustacea, described by Alexandrowicz*? and studied physio- 
logically by Wiersma, Furshpan, and Florey,** Kuffler,*4 and Eyzaguirre and Kuf- 
fler.*~-*" This is a sensory neuron whose cell body lies in the periphery, close to a 
special muscle fiber bundle (Florey and Florey**) to which it is connected by sev- 
eral dendrites emerging from one side, while from the other side of the soma an 
axon takes origin to run to the central nervous system. Under steady stretch, 
this cell maintains a steady discharge of impulses at a certain frequency, and this 
discharge is highly rhythmic, so that very small alterations in its frequency are 
conspicuous. 

We have used the abdominal receptors in the crayfish (Cambarus clarkii). These 
were prepared under a microscope, and the muscle bundle of the nonadapting re- 
ceptor only was mounted in a device similar to that described by Eyzaguirre and 
Kuffler.** The companion receptor, which is a thicker-fibered and adapting one, 
was usually cut off. The isolated preparation was suspended in a plastic box per- 
mitting dark-field illumination and was so positioned that the soma, dendrites, and 
muscle bundle were in a saline solution (Van Harreveld*’), with the sensory axon 
rising into a layer of mineral oil, where recording electrodes picked up the discharge 
of impulses. Polarizing currents were applied through Ag-AgCl electrodes in con- 
tact with the preparation in different positions. In one series of experiments a 
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device was used which consisted of bar polarizing electrodes maintained at a con- 
stant separation and so mounted as to permit rotation with respect to the prepa- 
ration (suspended from the paired forceps and recording electrodes). These bar 
electrodes were 1.5 cm. long and 2.5 em. apart and provided in the central region 
occupied by the preparation a uniform field in one plane. 

The voltage gradient was measured by micropipettes filled with 3 .M KCl placed 
close to the cell at opposite sides in the plane of polarization. The cell is typically 
10-60 uw long in the axono-dendritie axis, and the electrodes were typically not 
closer than 10 uv to the cell, no attempt being made to place them exactly in con- 
tact with the cell surface. Their tips could be seen, and the distance between them 
measured with a micrometer. A push-pull cathode follower input was used in con- 
junction with a Grass P-6/DC amplifier. This, together with a Dumont Type 322 
cathode-ray oscilloscope, provided maximal amplification of 1 millivolt per inch. 
The nerve impulses led from the axon were monitored through a separate condenser- 
coupled amplifier. A galvanometer in series with the polarizing circuit measured 
the intensity of the imposed current, supplied by a battery and regulated with a 
potentiometer. 

A second preparation used was the cardiac ganglion of the lobster (Panulirus 
interruptus). The ganglion was isolated completely, except for the posterior at- 
tachments to the myocardium, by which it remained suspended across a hole dis- 
sected in the myocardium. The preparation was immersed in mineral oil. The 
polarizing electrodes were applied to the anterior and posterior ends of the ganglion, 
and a pair of silver wires placed in the middle of the ganglion, separated longitudi- 
nally in the axis of polarization, provided both monitoring of the spontaneous rhyth- 
mical electrical discharge of the ganglion cells and recording of the voltage drop 
during the imposed polarization. The distance between these electrodes was 
measured with the ocular micrometer. This ganglion contains only five large neu- 
rons and four small ones (Alexandrowicz,“ Maynard,‘': 4? Hagiwara and Bul- 
lock**: #4). In this case we could not place the electrodes with reference to a cer- 
tain ganglion cell whose activity was identified, and we could not be sure that our 
estimate of threshold was the most sensitive or that the effect seen was not exerted 


over some small distance in the neuropile. 
RESULTS 


Comparative Effects of Imposed Current on Active and Inactive Neurons.—Both 
slow- and fast-adapting receptors, isolated from the same segment of the crayfish, 
were used in these experiments, and the preparations were completely immersed in 
mineral oil. A continuous stretch was applied to the muscle fibers, such that the 
nonadapting receptor was rhythmically firing between 5 and 15 impulses per sec- 
ond, while the adapting receptor was silent but was easily made to fire by a small 
increase in stretch. Polarizing electrodes were applied, one on the muscle bundles 
and the second on the afferent nerve close to the cell bodies. In this condition 
3.6 X 10-* ampere was sufficient to change the frequency of firing of the non- 
adapting receptor by 5-10 per cent. A current of 6-6.5 X 10-8 ampere changed 
the frequency about 35 per cent, and 1-1.2 X 10~7 ampere sufficed to alter the fre- 
quency 65-72 per cent. None of these currents induced the adapting receptor to 
discharge. Increasing the current to more than 20 times the threshold value for 
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acceleration of the already active neuron was still not sufficient to excite the inactive 
neuron. But still stronger currents did produce transient bursts therein. The 
directions of polarization to produce inhibition and acceleration were respectively 
cathode and anode on the nerve. 

Voltage Gradient across the Soma Adequate To Modulate Frequency of an Already 
Active Neuron.—Using the nonadapting receptor immersed in saline, the voltage 
gradients measured between two microelectrodes placed close to the origin of the 
axon and close to the muscle bundle, respectively, in the axis of polarization, were 
in the most favorable cases 0.08—0.12 millivolt for effects between 5 and 25 per cent; 
0.15-0.2 millivolt for effects of about 30 per cent; 0.2-0.4 millivolt for effects of 
more than 100 per cent (in the case of inhibition, a complete inhibition followed by 
rebound). These cells were ca. 60-80 uw in this dimension. The effective polari- 
ties were the same as those just given. 

Preferential Axis of Polarization.—These values of voltage gradient were all ob- 
tained in the best axis of polarization of the neuron. When the field was rotated, 
a significant increase of the applied current was necessary in order to reproduce the 
same effect as that obtained in the axono-dendritie axis. At 25° of rotation, from 
0 to 30 per cent more current was necessary. At 45°, from 1'/, up to 2!/, times as 
much current was necessary in most cases. At 90° rotation, in all experiments, the 
current needed to produce the same effect was from 1'/, up to 4 times as great. 
In a few cases, at 45° and 90° of rotation the polarities for acceleration and in- 


hibition were reversed compared to those effective in the best axis. Control ex- 
periments and measurements showed that rotation did not change the threshold 


of current intensity by altering the position of the preparation in the field between 
polarizing electrodes. Changing the position of the cell without rotation did not 
alter the threshold current, and measuring the field showed a linear curve of voltage 
drop against interelectrode distance until the recording electrodes approached the 
polarizing electrodes within 2 mm., when a steep drop occurred. 

Measurements in a Population of Neurons.—In the cardiac ganglion of the lob- 
ster, currents as low as 4.3 X 10-7 ampere are able to inhibit or accelerate by 10 per 
cent the frequency of the rhythmic bursts corresponding to heart beat. We did 
not attempt in these experiments to identify the activity of a particular one of the 
nine ganglion cells and measure the potential drop across it. But it may be as- 
sumed that the gradient measured between any pair of points in the ganglion sepa- 
rated by 100 u will be approximately the same as between another pair of points of 
the same total cross-sectional area, since the impedance of the tissue is very low, 
and we did not encounter evidence of gross inhomogeneity in resistance. In the 
most favorable cases, the threshold voltage drop across 100 w was 0.08-0.2 milli- 
volt. The large ganglion cells are of the order of 40-60 y in diameter of the cell 
body. Only the one axis of polarization was tried in this case, but the anatomy of 
the ganglion shows a predominance of orientation of cells in the long axis of the 
ganglion. The polarity, in agreement with Maynard,” was anterior anodale for 
inhibition and vice versa. 


DISCUSSION 


The results indicate that the output of neurons, provided that they are firing 
autorhythmically or presumably also if they are firing at a steady rate as a conse- 
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quence of a constant input, can be easily modulated in frequency by very low volt- 
age gradients (ca. 0.1 mv/100 «) measured in the external field across the whole 
soma, as compared to the voltage gradient necessary to stimulate a silent neuron. 
It will be clear in the evaluation of the values given that since they are measured 
across the maximum dimension, they are the maximum possible values across an 
effective dimension in the most effective axis of polarization. The voltage drop 
actually responsible for the modulation of firing frequencies can only be lower. 
This would be true particularly in the case in which a strategic portion of the mem- 
brane is of critical importance for the initiation of the discharge, since the value of 
the voltage gradient acting would be lower than that here measured in proportion 
to the smaller dimensions over which it acts, the total field in which the neuron lies 
being a relatively good conductor compared to the cell with its high-resistance 
membrane. 

Aside from this consideration, it is obvious that even if the voltage gradient is as 
great as 0.1 mv/100 yu, the transmembrane potential change, resulting from the 
application of the polarizing current, will be smaller, something less than half of 
this in the maximum case. Even that value can obtain only at or near the poles 
of the cell, and at these two poles the effect on the resting membrane potential will 
be of opposite sign for a given direction of polarizing current. In the equatorial 
region of the cell, with respect to the axis of polarization, there will be no change in 
the membrane potential. 

It is difficult to understand, on contemporary conventional theories of the deter- 
mination of neuronal firing, how such a change in the membrane potential can in- 
fluence the frequency of firing. Not only is it small, but there is in fact no change in 
the membrane potential of the neuron describable by a single term. Only by as- 
suming an exceedingly critical firing level in voltage and, additionally, a limited 
portion of the neuron near one of the poles as being the region whose membrane 
potential is significant in determining firing, can the effect of the applied current be 


interpreted as acting through subtraction or addition from the slowly rising de- 
polarization leading to each impulse. Of course, the critical polar region of the 


cell need not be a point but can be an extensive area of membrane, since the equator 
under consideration is not a geometrical but an electrical one and may be located at 
an anatomically quite polar latitude, depending on the area and the specific resist- 
ance of the membrane on each side of it. The difficulty with this hypothesis is 
that it requires that the very low-resistance axoplasm be at significantly different 
potentials in different parts of the interior of the neuron, a possibility which is cer- 
tainly not excluded but for which we do not have independent evidence. Further- 
more, the magnitude of the changes in steady depolarization, commonly associated 
with the control of firing, are of quite a different order of magnitude.“-*® The 
same is true of the values of both synaptic inhibitory and excitatory potentials, 
which are several millivolts in amplitude, while the potentials here considered must 
be not greater than some tens of microvolts. 

An alternative hypothesis may be that the imposed currents are acting by creat- 
ing or increasing an already present gradient between different portions of the cell 
membrane, i.e., that the critical difference is not the change in the transmembrane 
potential but the change in the potential between one point on the surface and an- 
other. These possibilities are not strictly alternative or mutually exclusive, and 
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they suggest two new or little-considered parameters of neuronal state critically 
significant in determining the activity of poised or already active cells over and 
above other conditions which alone may be quite adequate to fire the neuron, e.g., 
specific chemicals or strong electric currents. 

The great sensitivity of neurons to small voltage differences tends to support the 
views already suggested to the effect that electric field actions can play a role in 
the determination of probability of firing of units in ganglionic masses within the 
central nervous system,**~°* at least in synchronizing cells,® **—® and that currents 
adequate to exert an effect are physiologically available.’ * 1% %~74 

If this is true, the considerations adduced here emphasize the significance of the 
morphology of the cell, since the geometric relations between the portions of the 
cell or dendritic membrane specially sensitive to the transmembrane changes, or the 
regions sensitive to difference in potential between different points on the surface 
with relation to the external field, acquire an overriding importance. Even more, 
the view here expressed emphasizes the importance of the architectural arrange- 
ments of groups of neurons and of their complex dendritic ramifications, which will 
be, as it were, antennae oriented with respect to the direction of currents of physio- 
logical significance to these cell masses. If this view is correct, it would be neces- 
sary to predict that in structures such as the cerebral cortex, in which orientation of 
many neurons is not random but characteristic, these neurons must normally be 
subject to physiologically significant field effects of a relatively constant axis. 
Since this axis is probably the most effective axis for polarization effects and since 
we may guess, for example in the case of the pyramidal cells of the cerebral cortex, 
that this will be the longitudinal axis of the cell, the expectation stated means that 
we would anticipate a higher voltage gradient in the extracellular fluid consequent 
to neuronally significant events in the axis normal to the surface than tangentially. 
But different axes may be simultaneously effective, as is suggested by the specific 
geometry of the processes in the molecular layer of the cerebellar cortex. 

Finally, it may be pointed out that this view, while superficially in agreement 
with the theory of Gesell,” “ is significantly different in this respect: that the 
gradient between one part of the surface and another, which is supposed to be im- 
portant in determining neuronal firing, exists only in the already active cell or, in 
the active cell, is sensitive to changes of a different order of magnitude. The silent 
cell, even when the attempt is made to poise it close to its threshold, as in the adapt- 
ing stretch receptor, requires enormously more voltage gradient to be fired than the 
active cell does to be modulated. 


SUMMARY 


1. The voltage in the extracellular field across a single nerve cell has been meas- 
ured with microelectrodes during polarization with imposed current just sufficien‘ 
to modify the frequency of firing of an already active cell, using a preparation fron 
the nonadapting stretch receptor of the crayfish abdomen and another from the 
cardiac ganglion of the lobster. 

2. It is argued that, unless the current density can be determined across the 
strategic portion of the cell membrane, this is the most suitable measure of the in- 


tensity of imposed currents which can exert an effect on the probability of firing of 


a neuron. The transmembrane potential does not provide such a measure. 
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3. In the most effective axis of polarization, it was found that a voltage gradi- 
ent in the neighborhood of 0.1 mv/100 4 markedly influenced active cells. Cur- 
rents of more than 20 times this value are required to fire a silent cell, even if it has 
been poised, i.e., the adapting stretch receptor, under a physiological degree of 
stretch. 

t. The actual value of the critical voltage drop across the essential structure can 
only be smaller than this, and it may be much smaller. The transmembrane po- 
tential change with polarization may be from 0 to nearly one-half this value and 
is thus, at the most, several orders of magnitude smaller than the changes in mem- 
brane potential for threshold electrical stimulation or for several cases cited from the 
literature associated with alteration in activity or with inhibition or excitation. 

5. The findings may mean that exceedingly critical firing levels in voltage across 
the membrane exist which would have to be confined to a certain portion of the cell 
near one of the poles relative to the effective axis of polarization. Alternatively, it 
is supposed that the imposed current acts by creating or increasing a gradient be- 
tween different portions of the cell membrane, not across it but between one point 
on the surface and another. These possibilities are not mutually exclusive, and 


they suggest two new or little-considered parameters of neuronal state critically 


significant in determining activity over and above other conditions. 

6. The great sensitivity of neurons to small voltage differences supports the 
view that electric field actions can play a role in the determination of probability 
of firing of units in ganglionic masses, in response to physiologically available cur- 
rents. If true, this conclusion emphasizes the significance of morphology of the 
cell and of architectural arrangement of groups of neurons and their dendritic 
ramifications. 
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INTRODUCTION 

Previous reports have dealt with a nerve growth-stimulating effect of mouse 
sarcomas 180 and 37 on sensory! and sympathetic ganglia of the chick embryo.’ 
The effect consisted in a striking increase in size and cell number of sensory and 
sympathetic ganglia adjacent to the tumor and of an extensive invasion of their 
fibers into the tumor. Transplantation of the tumor onto the allantoic membrane 
had a growth-promoting effect similar to the intra-embryonic transplantations.* 4 
In the allantoic transplants the tumor was not accessible to nerve fibers; the effect 
was therefore mediated through the circulation which the tumor shared with its 
host. Embryos bearing mouse tumors in the abdominal wall or on the allantoic 
membrane differed from controls also in another respect: the viscera were over- 
loaded with sympathetic nerve fibers, whereas control embryos showed practically 
no visceral innervation at corresponding stages of development. Large nerve 
bundles were also found in blood vessels, where they formed agglomerates project- 
ing into the lumen of the vessel. The agent released by sarcomas 180 and 37 has 
therefore a dual effect: it stimulates the overgrowth of sensory and sympathetic 
ganglia and is responsible for the aberrant distribution of sympathetic nerves in the 


viscera and blood vessels. Two alternative hypotheses were offered: either the 


agent affects primarily the growth and differentiation processes of nerve cells, or 
changes of the periphery are the primary effect and the overgrowth of the ganglia 
is an indirect effect. The following experimental result favored the first hypothe- 
sis. 

Sensory and sympathetic ganglia were explanted in vitro in proximity to a frag- 
ment of mouse tumor. Under these conditions the ganglia are directly exposed to 
the tumor, and influences of the organism are excluded. Ganglia explanted to- 
gether with normal embryonic tissues of chick or mouse served as controls. 
Whereas very few nerve fibers emerged from the control ganglia in the first 24 
hours, a dense fibrillar “halo” surrounded the ganglia exposed to the tumor at the 
end of the first day.6 These experiments indicated a direct action of the neoplastic 
agent on nerve cells. The tissue-culture method was then used to test the effect 
of a cell-free homogenate of the tumor. When it was found to possess the same 
activity as the growing tumor in vitro, the investigation was directed toward the 
purification and identification of the agent.® 7 

In the course of this work another growth-stimulating agent was discovered in 
snake venoms. The properties of this agent will be dealt with in another com- 
munication.’ It is the object of the present report to describe its biological effects 
on sensory and sympathetic ganglia in vitro and in vivo and to compare them 
with those of the tumor factor. 
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EXPERIMENTS IN VITRO 
The investigation was centered on sensory ganglia of 7-day chick embryos. 


The standard hanging-drop technique was used. The medium consisted of one 


part plasma, one part synthetic medium, and one part of the snake venom to be 
assayed. We used crude venom from Crotalus adamanteus (rattlesnake) and 


Agkistrodon piscivorus (moccasin) and a protein fraction (DEAE fraction) derived 
from the moccasin venom.’ In each culture three to four ganglia were explanted. 
A total of several hundred ganglia were tested. <A series of controls was run in 
which one part of an isotonic saline solution was added to the medium instead 
of the snake-venom fraction. The cultures were incubated at 37° and examined 
after 6, 12, and 24 hours. 

Results —The addition to the medium of 7 ug/ml of the crude snake venom or of 
1 ug/ml of the DEAE fraction had identical effects. The following description 
therefore applies to the experiments performed with both. The results were in 
fact remarkably consistent, and only minor fluctuations were observed. 

Six hours after the beginning of the experiment, a dense ‘“‘halo”’ of short nerve 
fibers surrounded the ganglia. The fibers increased rapidly in length and density 
in the following hours. At 24 hours the fibrillar halo had reached its maximal 
density (Fig. 1). The characteristics of this most unusual fiber outgrowth were 
identical with those described in previous investigations dealing with mouse 
tumor; ® they will therefore not be recounted here (Figs. 3 and 4). It may only 
be mentioned that the cultures differed from controls also in another respect. 
Whereas in control cultures the fiber outgrowth was very scanty in the first 24 hours 
and the nerve fibers grew in a wavy, irregular fashion (Fig. 2), in the present ex- 
periments the fibers grew straight, radially, and showed extensive branching, 
forming a brushlike border at their tips. In most cases, a second, shorter halo 
formed in the clot between the twelfth and the twenty-fourth hour. As in the pre- 
vious experiments with tumor, very few or no spindle cells were found among the 
nerve fibers in the first 24 hours. In the following days, however, a large number of 
these cells migrated from the explant between the fibers. In the present experi- 
ments a considerable degree of liquefaction of the plasma occurred. As a result, 
a disintegration of the fibrillar halo was observed in many of the cultures after 48-72 
hours. The cultures were then discontinued. 

A more limited number of experiments were performed with sympathetic ganglia 
explanted from 8- to 9-day chick embryos. The results were in all respect similar 


to the ones described above. 
EXPERIMENTS IN VIVO 


It had been suggested previously that the in vitro and in vivo effects of tumors 
were produced by the same agent. However, preliminary experiments of injec- 
tions of active tumor fractions into the yolk of 7-day embryos were negative. 
We attribute this failure to the fact that the pure fraction was available only in 
very small amounts and that probably the quantities injected were considerably 
smaller than those produced continually by living tumor. The striking similarity 
between the in vitro effects of the tumor and of the snake venom, and the fact 
that the activity of the purest fraction of snake venom is 1,000 times that of the 
purest tumor fraction (on a dry-weight basis), suggested a repetition of the injection 
experiments with snake venom. 
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The effect of the venom was assayed by injecting it into the yolk of embryos 
ranging between 6 and 8 days of development. A small window was opened in the 
shell, and the fraction DEAE of the venom (containing 150 ug protein/ml together 
with 100 ug/ml of terramycin) was injected into the yolk with a 1-ml. syringe. 

In these preliminary experiments the total amount of venom injected, the stage 
of the embryos, and the length of time of the experiments were varied, in order to 
determine (a) the tolerance of the embryo, (b) the stages of development which 
are sensitive to the effect of the venom, and (c) the length of time required to secure 
an effect. 

The injections were repeated two or three times a day. A total of 120 embryos 
were injected; 17 were recovered and impregnated with silver following the Cajal 
De Castro technique. They were sectioned serially and used for the present in- 
vestigation. 

Results.—The toxicity of the venom was reflected in the severe mortality and in 
the diffuse edema which affected the surviving embryos. The results proved be- 
yond doubt that the venom possesses a growth-stimulating effect on sensory and 
sympathetic ganglia of the embryo similar in every respect to the effect of mouse 
sarcomas. The increase in size of the sympathetic system and, in particular, of its 
prevertebral chain was so striking that it overshadowed the reaction of the sensory 
ganglia. The same results were obtained in embryos bearing mouse sarcomas. 
The analysis was then focused on the sympathetic system. 

Of the 17 embryos, 11 had been injected over a period of between 36 hours and 4 
days, and 6 over a period of only 26 hours. Ten of the 11 embryos injected for a 
longer period showed a considerable increase in size of the sympathetic ganglia and 
an atypical distribution of nerve fibers in the viscera. The intensity of the effect 
varied in different embryos: it increased parallel to the length of the time the 
embryos had been exposed to the venom. A definite increase in the prevertebral 
sympathetic complex was observed in one embryo injected over a period of 36 
hours, from 6 to 7!/2 days, with a total amount of 0.5 ml. Nerve bundles emerging 
from the prevertebral complex entered the adjacent mesonephroi (Figs. 9 and 10). 
No nerves are found in the mesonephroi at corresponding stages in control embryos. 
The nine other embryos which showed a positive effect were fixed 2, 3, and 4 days 
after the beginning of the experiment. The total amounts of venom injected 
were: 0.6 ml. in one, 1.3 ml. in four, 1.8 ml. in two, and 3.0 ml. in two cases. A 
maximal effect was observed in the two embryos injected over a period of 4 days 
with 3.0 ml. and recovered at 10 days. The embryos injected for only 26 hours 
showed no effect. 

In one of the embryos injected over a period of 4 days, one ganglion of the pre- 


vertebral sympathetic complex at the level of the adrenal was measured. Each 
section was projected with the help of the camera lucida, and the contour of the 
ganglion was drawn on cardboard. The total number of sections was then weighed, 
and the weight was compared to that of the same prevertebral ganglion of a control 


embryo. 

The results gave an increase in weight of 3.2 times the control. Since the weight 
is proportional to the volume, the figure indicates a similar increase in volume. 
Estimates of cell numbers were obtained by inserting a micrometer disk in the 
ocular and counting the number of cells in 200 squares each in comparable regions 
of ganglia from normal and injected embryos, using oil immersion. The results are 
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1,066 cells in 200 squares of the injected embryo and 2,158 in the control. The 
ratio of cell number per unit area was therefore 1:2. The volume ratios were then 
divided by the cell-count ratios to obtain an estimate of the relative increase in cell 
numbers. The latter amounts to 1.6. Hence the over-all increase in size is due to 
a greater extent to cellular hypertrophy and to a lesser extent to an increase in cell 
numbers. Comparable results were obtained in embryos bearing tumors.? 

A comparison of hyperplastic and normal ganglia (Figs. 5 and 6) shows that 
the cells of the injected embryos are not only larger than the controls but also in a 
more advanced stage of differentiation. This is attested by the presence of neuro- 
fibrils in the cytoplasm of almost all the sympathetic neurons in the experimental 
cases, but only in very few of the controls (Figs. 7 and 8). Furthermore, the nerve 
bundles which emerge from the hyperplastic and hypertrophic ganglia are much 
larger than those in the controls. 


COMPARISON OF THE IN VIVO EFFECTS OF MOUSE 
SARCOMAS AND SNAK® VENOM 


The results of the preliminary experiments with snake venoms reported here 
point to a striking similarity between the two phenomena. 

In both groups of experiments not only the mesonephroi (Figs. 9 and 10) but also 
other viscera—thyroid, metanephroi, spleen, and liver—are invaded by sympa- 
thetic fibers. In the experiments with snake venom as in those with tumor, nerve 
bundles perforated the wall of thoracic veins and projected inside the blood vessels 
(Figs. 11 and 12). Since the experiments with venom were not performed in as 
late developmental stages as those with tumor, the neuromas were of small size and 
comparable only to neuromas found in 9-10-day embryos bearing a mouse tumor. 
The quantitative data reported above are also in line with those given for tumor 
effects.” 4 

In both groups of experiments no morphological changes were detected in any 
part of the nervous system other than spinal and sympathetic ganglia. 


CONCLUDING REMARKS AND SUMMARY 


A nerve growth-promoting agent was discovered in snake venoms, and its in vitro 
and n vivo effects were investigated. 

In both sets of experiments the nerve fiber outgrowth from sensory and sympa- 
thetic ganglia was found to be far beyond the normal range. Furthermore, the 
experiments in vivo gave evidence of a remarkable increase in cell size and cell num- 
bers in sympathetic ganglia and of an acceleration of their differentiative processes. 
The nerve fibers produced in excess entered into the adjacent viscera. 

These observations show that the snake venoms duplicate the effects of a “pro- 
tein fraction” of mouse sarcomas in vitro and the effect of living sarcomas in vivo. 
The similarity of the two agents is further emphasized by the data which were ob- 
tained in attempts to purify them. In both instances, we are apparently dealing 
with a protein or a substance bound to protein.” * The purest fraction of snake 


venom obtained so far is approximately 1,000 times as effective as our purest frac- 
tion of mouse tumor (on a dry-weight basis). This fact as well as the ready avail- 
ability of different snake venoms will facilitate the further analysis of the phenomena 
reported above. 
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Fics. 1-4.—Microphotographs of silver-impregnated sensory ganglia, comparing the effect of 

the snake venom, of the intact tumor, of the cell-free extract of the same tumor, and a control 

ganglion. All ganglia were isolated from 7-day embryos and cultured for 24 hours. Fig. 1, 

sensory ganglion growing in a medium to which the fraction DEAE of moccasin venom had been 

added. Fig. 2, control ganglion. Fig. 3, ganglion combined with a fragment of sarcoma 37 (S). 
Fig. 4, ganglion growing in a medium to which cell-free extract of the tumor has been added. 





Fics. 5-12.—-Microphotographs of silver-impregnated control and experimental embryos. 
Fig. 5, prevertebral sympathetic ganglion of a 10-day embryo. Fig. 6, prevertebral sympathetic 
ganglion of a 10-day embryo injected with moccasin snake venom (fraction DEAE) for 4 days. 
Figs. 7 and 8, enlarged sectors of Figs. 5 and 6, respectively. Figs. 9 and 10, mesonephros of a 
control and of an injected 10-day embryo. Notice the svmpathetic nerve fibers between the 
mesonephric tubules in the injected embryo. Fig. 11, neuroma projecting into the lumen of a 
vein of a 9-day embryo with an extra-embryonic sarcoma 37. Fig. 12, same, in a 10-day embryo, 
injected with moccasin venom, fraction DEAE (see text). 
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Our failure, so far, to obtain positive effects after in vivo injection of mouse 
tumor fractions was attributed to the relative weakness of the tumor fraction 
rather than to the presence of two different agents in the tumor—one that would 
be responsible for the in vivo effects and one responsible for the in vitro effects. 
The present finding that the much stronger venom factor reproduces both effects 
in every detail gives further strong support to our contention. 


* This work has been supported by Grant B-463 from the National Institute of Neurological 
Diseases and Blindness of the National Institutes of Health, Public Health Service, by a grant 
of the American Cancer Society, recommended by the Committee on Growth, National Research 
Council, and by a contribution from an institutional grant of the American Cancer Society to 
Washington University. The able assistance of Miss Barbara Bankhead is gratefully acknowl- 
edged. 
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ON TIME LAGS IN EQUATIONS OF GROWTH 
By P. J. WANGERSKY AND W. J. CUNNINGHAM 


OSBORN ZOOLOGICAL LABORATORY; DUNHAM LABORATORY OF ELECTRICAL ENGINEERING, 


YALE UNIVERSITY, NEW HAVEN, CONNECTICUT 
Communicated by G. E. Hutchinson, July 27, 1956 


An earlier paper by one of the authors! analyzed the equation 


dN) _ bN.. |* ~ N ; >| (1) 
dt K A 


which has been suggested? as a mathematical description of a population growing 
at a constant reproductive rate (b) toward a saturation value (KX), with a self-regu- 
latory system represented by (K — N)/K which takes a finite time (7) te react to 
changes in the environment caused by the approach of the population to the satu- 
ration level. 

When this equation is used to describe real populations, a second time lag, al- 
though not expressed, is clearly implied. In the species of animals normally used 
for laboratory experiments in population dynamics a finite amount of time is needed 
for the animal to respond to changes in its environment. The lag introduced into 


the self-regulatory system in equation (1) becomes of great importance as (NV) ap- 
proaches (K). While the population is still in the exponential stage of its growth, 
and (N) is quite small in comparison to (K), another lag, the time needed for the 


reproductive process to take place, is more important. 
The equation as written states that the change in the number of animals in the 
time interval (dé) is some function of the number of animals present at time (é). 
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This function is a statistical estimate of the proportion of animals present at time 
(t) which were beginning the process of reproduction at some previous time such 
that they will give birth at time (¢ + dt). As long as the population is in the ex- 
ponential part of the growth curve, the time involved in the process of reproduction 
is sufficiently constant in a given species that it may be expressed as a constant lag, 
(71). Actually, even in this part of the growth curve there is considerable vari- 
ation between individuals in reproductive time lag. The results of this variation 
will be discussed later. The equation of growth can be written with this lag ex- 


pressed, rather than implied, as 


aNwy ; |= —Na- | 
DIV ge. 5 : . 
at K 


or, when simplified by combining constants, as 


dN 


ONG) — CNGHny Nu-ww; 
dt 


where c = b/K and 72 = reaction time lag. 

For this equation there are two equilibrium, or singular, values of N at which 
dN,.,/dt = 0. These values are (1) N = N; = Oand (2) N = N= b/c = K. In 
studying the solutions for equation (3), it is most convenient first to normalize the 


equation, using the definitions p = t/7, q = 12/71, and r N/No. In terms of 


these quantities, equation (3) becomes 


/ — ~ 
a Vile brilrip—1 os T(p—l(p—¢)], (4) 


where primes indicate differentiation with respect to (p). At singularity (2), 
(r) has the value unity. Near this singularity the solution may be studied by re- 
placing (r) with (1 + u«), where (wv) is a small quantity. The resulting variational 
equation is 


u(y) - bril—u 3 (5) 


where a product term is neglected. If (q) is small, it is a fair approximation to 
write 


= , WP ee > 
U(p—q) = Up) — QU wp) + */2q*u" (py), (6) 


which, combined with equation (5), gives 


2 l 
u” + {(, _ a)u + “| = 0. 
ge \bri 


Equation (7), approximately valid near singularity (2), indicates that, if q 
1/br;, or brz = 1, a steady-state oscillation will occur. The period of this oscillation 
in real time is T = 227,/(2/q?)”?2 = Qnro. That is, if values of (b) and (72) 
are large enough, the population will oscillate in a cyclic manner about the saturation 
value with a period approximately 4.4 times the reaction time lag. Furthermore, 
if [(2/q?) (1/bri — q)]* = 4(2/q?), or brz = +/2 — 1, critical damping exists. Popu- 
lations with slow rates of growth or very fast reaction times will approach the satu- 
ration value monotonically. Intermediate values of (b) and (72) give rise to damped 
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oscillation about the saturation value. This would seem to be the type of curve 
generally observed in laboratory growth experiments. Utida* found such a 
damped oscillation in populations of the azuki bean weevil. Gause’s* data on the 
growth of Paramecium aurelia and P. bursaria in separate cultures suggest this 
type of oscillation. Unfortunately, the cultures were not carried long enough or 
observed often enough to allow quantitative use of his data. 

Near singularity (1), the variational equation for the solution is 


, 
U (py) = bri[u p-l |, 
which is approximately equal to 


br; 


1 + bry 


u. (9) 


The solution here is a growing exponential curve, with the growth rate in terms 
of real time being b/(1 + bry). 

The result of this analysis is that near VN = N, = 0 the solution depends pri- 
marily upon the product (b7:).. The presence of the delay (7;) makes the initial 
growth rate less than (b), which it would be if (71) = 0, by the factor 1/(1 + b7;). 
Near N = N2 = K, the solution depends primarily on the product (b72). Thus, 
at the beginning of growth, while the population is increasing exponentially, the 
change in N depends largely on the reproductive time lag. As the population nears 
its saturation value, the governing factor becomes (72), the reaction lag. In many 
cases in nature we are dealing with populations near their equilibrium levels, and 
we may safely disregard the reproductive lag. However, in populations such as 
plankton and bacteria, which are repeatedly reduced to very small numbers by ex- 
haustion of the medium or by seasonal changes in physical factors, the reproductive 
lag may be more important than the reaction lag in describing the population. 

Since reproduction in animals is a discontinuous process, while the growth 


equations are continuous, the agreement between theory and experiment at the 


beginning of population growth can be no better than fair. The mathematician 
can construct fractional animals somewhat more easily than can the experimental- 
ist. For those species whose generations do not overlap, any continuous function 
can be only an approximate description and a working guide. In most species of 
animals, however, there is enough overlap of generations and enough individual 
variability in reproductive lag so that even a population started from one pair of 
animals will behave as a continuous function before very many generations have 
passed. 

The effect of reproductive lag on a population can probably be demonstrated 
most readily in extremely small populations and in the very beginning of population 
growth. Many investigators have found a definite lag in the growth rate of newly 
started populations and have started their mathematical descriptions at the end of 
the lag period. This lag has sometimes been attributed to the necessity for the 
animal to “condition” the medium and to other external causes. In the case of 
these animals living in a liquid medium, this might mean the removal of traces of 
metal ions by chelation or adjustment of the pH by the production of metabolic 
waste products. Some part of the lag must be internal, due to the reproductive 
lag discussed above. Differences in the physiological state of the animals when 
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they are added to the culture could account for differences in lag in separate ex- 
periments. It should be possible to test this in the laboratory. 

Oscillation about Ny may occur if (br2) is large enough. It is possible that the 
intrinsic oscillations in populations postulated by Slobodkin® are due at least in 
part to the time lags inherent in the physiology of the animals. When the lags 
are of sufficient size, such oscillations must occur but may be hidden by the effects 
of external events. 

The approximation for u,,—,) used in equation (6) is sufficiently poor that nu- 


merical values of (b) and (72) for critical damping and for steady-state oscillations 
found from equation (7) are somewhat in error, being too small. The differences 


here are quite similar to those found in studying equation (1) with (71) = 0, shown 
in an earlier paper. ! 

Of the three types of solutions found—the slow monotonic approach to satu- 
ration, the damped oscillation, and the limit cycle—the damped oscillation would 
seem to be the most advantageous to a species in an evolutionary sense. An oscil- 
lation of some sort, with continuous removal of a part of the population, would 
serve to keep the population structure from becoming overloaded with older mem- 
bers. The oscillation would serve somewhat the same purpose as the ‘jitter’ 
which is built into some mechanical systems to remove the necessity for over- 
coming friction when changing from one level of activity to another. If some sort 
of oscillation is indeed an advantage to the species, a damped oscillation with its 
faster recovery from disturbances would be far preferable to a cyclical one. In 
order for a limit cycle generated in this manner to be of small amplitude, very 
careful balancing of (b) and (72) is needed. The usual limit cycle approaches zero 


fairly closely and so would be particularly vulnerable to disturbances. 


The authors are greatly indebted to Professor G. E. Hutchinson and Dr. Gordon 
A. Riley for their many comments and suggestions. 
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ERRATUM: COURSE OF PRODUCTION OF AN ISOANTISERUM 
EFFECTING TUMOR HOMOGRAFT SURVIVAL IN MICE 
In the article of the above title appearing in these ProcEEDINGS, 42, 269-273 
1955, the following correction should be made: 
On page 270, column 2, line 4 of Table 1, for “20/20” read “20/22”. 


NATHAN KALISS 
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